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Introduction 


This text presents ail introduction to algebra suitable for upper-level undergraduate 
or beginning graduate courses. While there is a very extensive offering of textbooks 
at this level, in my experience teaching this material I have invariably felt the 
need for a self-contained text that would start ‘from zero’ (in the sense of not 
assuming that the reader has had substantial previous exposure to the subject) but 
that would impart from the very beginning a rather modern, categorically minded 
viewpoint and aim at reaching a good level of depth. Many textbooks in algebra 
brilliantly satisfy some, but not all, of these requirements. This book is my attempt 
at providing a working alternative. 

There is a widespread perception that categories should be avoided at first 
blush, that the abstract language of categories should not be introduced until a 
student has toiled for a few semesters through example-driven illustrations of the 
nature of a subject like algebra. According to this viewpoint, categories are only 
tangentially relevant to the main topics covered in a beginning course, so they can 
simply be mentioned occasionally for the general edification of the reader, who will 
in time learn about them (by osmosis'/). Paraphrasing a reviewer of a draft of the 
present text, ‘Discussions of categories at this level are the reason why God created 
appendices.’ 

It will be clear from a cursory glance at the table of contents that I think 
otherwise. In this text, categories are introduced on page 18, after a scant reminder 
of the basic language of naive set theory, for the main purpose of providing a context 
for universal properties. These are in turn evoked constantly as basic definitions 
are introduced. The word ‘universal’ appears at least 100 times in the first three 
chapters. 

I helieve that awareness of the categorical language, and especially some ap- 
preciation of universal properties, is particularly helpful in approaching a subject 
such as algebra ‘from the beginning’. The reader I have in mind is someone who 
has reached a certain level of mathematical maturity — for example, who needs no 
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special assistance in grasping an induction argument — but may have only been ex- 
posed to algebra in a very cursory manner. My experience is that many upper-level 
undergraduates or beginning graduate students at Florida State University and at 
comparable institutions fit this description. For these students, seeing the many in- 
troductory concepts in algebra as instances of a few powerful ideas (encapsulated in 
suitable universal properties) helps to build a comforting unifying context for these 
notions. The amount of categorical language needed for this catalyzing function 
is very limited; for example, functors are not really necessary in this acclimatizing 
stage. 

Thus, in my mind the benefit of this approach is precisely that it helps a true 
beginner, if it is applied with due care. This is my experience in the classroom, 
and it is the main characteristic feature of this text. The very little categorical 
language introduced at the outset informs the first part of the book, introducing 
in general terms groups, lings, and modules. This is followed by a (rather tradi- 
tional) treatment of standard topics such as Sylow theorems, unique factorization, 
elementary linear algebra, and field theory. The last third of the book wades into 
somewhat deeper waters, dealing with tensor products and Horn (including a first 
introduction to Ibr and Ext) and including a final chapter devoted to homological 
algebra. Some familiarity with categorical language appears indispensable to me 
in order to appreciate this latter material, and this is hopefully uncontroversial. 
Having developed a feel for this language in the earlier parts of the book, students 
find the transition into these more advanced topics particularly smooth. 

A first version of this book was essentially a careful transcript of my lectures in 
a run of the (three-semester) algebra sequence at FSU. The chapter on homological 
algebra was added at the instigation of Ed Dunne, as were a very substantial number 
of the exercises. The main body of the text has remained very close to the original 
‘transcript’ version: I have resisted the temptation of expanding the material when 
revising it for publication. I believe that an effective introductory textbook (this 
is Chapter 0, after all. . . ) should be realistic: it must be possible to cover in class 
what is covered in the book. Otherwise, the book veers into the ‘reference’ category; 
I never meant to write a reference book in algebra, and it would be futile (of me) 
to try to ameliorate excellent available references such as Lang’s ‘Algebra’. 

The problem sets will give an opportunity to a teacher, or any motivated reader, 
to get quite a bit beyond what is covered in the main text. To guide in the choice of 
exercises, 1 have marked with a > those problems that are directly referenced from 
the text, and with a -> those problems that are referenced from other problems. A 
minimalist teacher may simply assign all and only the > problems; these do nothing 
more than anchor the understanding by practice and may be all that a student 
can realistically be expected to work out while juggling TA duties and two or three 
other courses of similar intensity as this one. The main body of the text, together 
with these exercises, forms a self-contained presentation of essential material. The 
other exercises, and especially the threads traced by those marked with will offer 
the opportunity to cover other topics, which some may well consider just as essen- 
tial: the modular group, quaternions, nilpotent groups, Artinian rings, the Jacob- 
son radical, localization, Lagrange’s theorem on four squares, projective space and 
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Grassmannians, Nakayama’s lemma, associated primes, the spectral theorem for 
normal operators, etc., are some examples of topics that make their appearance in 
the exercises. Often a topic is presented over the course of several exercises, placed 
in appropriate sections of the book. Ebr example, ‘Wedderbum’s little theorem’ 
is mentioned in Remark III.1.16 (that is: Remark 1.16 in Chapter III); particular 
cases are presented in Exercises III.2.11 and IV .2.17, and the reader eventually ob- 
tains a proof in Exercise VII.5.14, following preliminaries given in Exercises VII.5.12 
and VII.5.13. The -> label and perusal of the index should facilitate the navigation 
of such topics. To help further in this process, I have decorated every exercise with 
a list (added in square brackets) of the places in the book that refer to it. For 
example, an instructor evaluating whether to assign Exercise V.2.25 will be imme- 
diately aware that this exercise is quoted in Exercise VI1.5.18, proving a particular 
case of Dirchlet’s theorem on primes in arithmetic progressions, and that this will 
in turn be quoted in §VII.7.6, discussing the realization of abelian groups as Galois 
groups over Q. 

I have put a high priority on the requirement that this should be a self-contained 
text which essentially crosses all t’s and dots all i’s, and does not require that the 
reader have access to other texts while working through it. I have therefore made 
a conscious effort to not quote other references: I have avoided as much as possible 

the exquisitely tempting escape route ‘For a proof, see ’ This is the main reason 

why this book is as thick as it is, even if so many topics are not covered in it. Among 
these, commutative algebra and representation theory are perhaps the most glaring 
omissions. The iirst is represented to the extent of the standard basic definitions, 
which allow me to sprinkle a little algebraic geometry here and there (for example, 
see §VII.2), and of a few slightly more advanced topics in the exercises, but I stopped 
short of covering, e.g., primary decompositions. The second is missing altogether. 
It is my hope to complement this hook with a ‘Chapter 1’ in an undetermined 
future, where I will make amends for these and other shortcomings. 

By its nature, this book should be quite suitable for self-study: readers working 
on their own will find here a self-contaiued starting point which should work well 
as a prelude to future, more intensive, explorations. Such readers may be helped 
by the following ‘9-fold way’ diagram of logical interdependence of the chapters: 


II 


IV 


m 


VII 


IX 


» 

VIII 

m 
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This may however better reflect my original intention than the filial product. For a 
more objective gauge, this alternative diagram captures the web of references from 
a chapter to earlier chapters, with the thickness of the lines representing (roughly) 
the number of references: 
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With the self-studying reader especially in mind, I have put extra effort into 
providing an extensive index. It is not realistic to make a fanfare for each and every 
new term introduced in a text of this size by an official ‘definition’; the index should 
help a lone traveler find the way back to the source of unfamiliar terminology. 

Internal references are handled in a hopefully transparent way. For example, 
Remark III. 1.16 refers to Remark 1.16 in Chapter III; if the reference is made from 
within Chapter III, the same item is called Remark 1.16. The list in brackets 
following an exercise indicates other exercises or sections in the book referring to 
that exercise. For example, Exercise 3.1 in Chapter I is followed by (5.1, §VIII.1.1, 
§IX.1.2, EX.1.10): this alerts the reader that there are references to this problem in 
Exercise 5.1 in Chapter I, section 1.1 in Chapter VIII, section 1.2 in Chapter IX, 
and Exercise 1.10 in Chapter IX (and nowhere else). 

Acknowledgments. My debt to Lang’s book, to David Dummit and Richard 
Foote’s ‘Abstract Algebra,’ or to Artin’s ‘Algebra’ will be evident to anyone who 
is familiar with these sources. The chapter on homological algebra owes much to 
David Eisenbud’s appendix on the topic in his ‘Commutative Algebra’, to Gelfand- 
Manin’s ‘Methods of homological algebra’, and to WeibePs ‘An introduction to 
homological algebra’. But in most cases it would simply be impossible for me to 
retrace the original source of an expository idea, of a proof, of an exercise, or of 
a specific pedagogical emphasis: these are all likely offsprings of ideas from any 
one of these and other influential references and often of associations triggered by 
following the manifold strands of the World Wide Web. This is another reason 
why, in a spirit of equanimity, I resolved to essentially avoid references altogether. 
In any case, I believe all the material I have presented here is standard, and I only 
retain absolute ownership of every error left in the end product. 
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I am very grateful to my students for the constant feedback that led me to 
write this book in this particular way and who contributed essentially to its success 
in my classes. Some of the students provided me with extensive lists of typos and 
outright mistakes, and I would especially like to thank Kevin Meek, Jay Stryker, 
and Yong Jae Cha for their particularly helpful comments. I had the opportunity 
to try out the material on homological algebra in a course giveu at Caltech in the 
fall of 2008, while on a sabbatical from FSU, and I would like to thank Caltech 
and the audience of the course for their hospitality and the friendly atmosphere. 
Thanks are also due to MSRI for hospitality during the winter of 2009, when the 
last tine-tuning of the text was performed. 

A few people spotted big and small mistakes in preliminary versions of this 
book, and I will mention Georges Elencwajg, Xia Liao, and Mirroslav Yotov for 
particularly precious contributions. 1 also commend Arlene O’Sean and the staff at 
the AMS for the excellent copyediting and production work. 

Special thanks go to Ettore Aldrovandi for expert advice, to Matilde Marcolli 
for her encouragement and indispensable help, and to Ed Dunne for suggestions 
that had a great impact in shaping the final version of this book. 

Support from the Max-Planck-Institut in Bonn, from the NSA, and from Cal- 
tech, at different stages of the preparation of this book, is gratefully acknowledged. 





Chapter I 


Preliminaries: Set theory 
and categories 


Set theory is a mathematical field in itself, and its proper treatment (say via the 
famous ‘Zermelo- Franlcel ’ axioms) goes well beyond the scope of this book and the 
competence of this writer. We will only deal with so-called ‘naive’ set theory, which 
is little more than a system of notation and terminology enabling us to express 
precisely mathematical definitions, statements, and their proofs. 

Familiarity with this language is essential in approaching a subject such as 
algebra, and indeed the reader is assumed to have been previously exposed to it. 
In this chapter we first review some of the language of naive set theory, mainly in 
order to establish the notation we will use in the rest of the book. We will then 
get a small taste of the language of categories , which plays a powerful unifying role 
in algebra and many other fields. Our main objective is to convey the notion of 
‘universal property’, which will be a constant refrain throughout this book. 


1. Naive set theory 

1.1. Sets. The notion of set formalizes the intuitive idea of ‘collection of objects’. 
A set is determined by the elements it contains: two sets A, B are equal (written 
A = B) if and only if they contain precisely the same elements. ‘What is an ele- 
ment'' is a forbidden question in naive set theory: the buck must stop somewhere. 
We can conveniently pretend that a ‘universe’ of elements is available to us, and 
we draw from this universe to construct, the elements and sets we need, implicitly 
assuming that all the operations we will explore can be performed within this uni- 
verse. (This is the tricky point!) In any case, we specify a set by giving a precise 
recipe determining which elements are in it. This definition is usually put between 
braces and may consist of a simple, complete, list of elements: 

A:= {1,2,3} 
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is 1 the set consisting of the integers 1, 2, and 3. By convention, the order 2 in which 
the elements are listed, or repetitions in the list, are immaterial to the definition. 
Thus, the same set may be written out in many ways: 

{1,2,3} = {1,3,2} = {1,2, 1,3, 3, 2, 3, 1,1, 2, 1,3}. 

This way of denoting sets may be quite cumbersome and in any case will only really 
work for finite sets. For infinite sets, a popular way around this problem is to write 
a list in which some of the elements are understood as being part of a pattern — for 
example, the set of even integers may be written 

£={••• ,-2,0, 2,4, 6, •••}, 

but such a definition is inherently ambiguous, so this leaves room for misinterpre- 
tation. Further, some sets are simply ‘too big’ to be listed, even in principle: for 
example (as one hopefully learns in advanced calculus) there are simply too many 
real numbers to be able to ‘list’ them as one may ‘list’ the integers. 

It is often better to adopt definitions that express the elements of a set as 
elements s of some larger (and already known) set 5, satisfying some property P. 
One may then write 

A — {» € S | s satisfies P} 

(€ means element of. . . ) and this is in general precise and unambiguous 3 . 

We will occasionally encounter a variation on the notion of set, called ‘multiset’. 
A multiset is a set in which the elements are allowed to appear ‘with multiplicity’: 
that is, a notion for which {2, 2} would be distinct from {2}. The correct way to 
define a multiset is by means of functions, which we will encounter soon (see Ex- 
ample 2.2) . 

A few famous sets are 

• 0: the empty set, containing no elements; 

• N: the set of natural numbers (that is, nonnegative integers); 

• Z: the set of integers; 

• Q: the set of rational numbers; 

• R: the set of real numbers; 

• C: the set of complex numbers. 

Also, the term singleton is used to refer to any set consisting of precisely one 
element. Thus {1}, {2}, {3} are different sets, but they are all singletons. 

Here are a few useful symbols (called quantifiers): 

• 3 means there exists. . . (the existential quantifier); 

*:= is a notation often used to mean that the symbol on the left-hand side is defined by 
whatever is on the right-hand side. Logically, this is just expressing the equality of the two 
sides and could just as well be written the extra : is a psychologically convenient decoration 
inherited boro computer science. 

2 Ordered lists are denoted with round parentheses: (1,2,3) is not the same as (1,3,2). 

3 But note that there exist pathologies such as Russell ’s paradox, showing that even this style 
of definitions can lead to nonsense. All is well so long as S is indeed known to be a set to begin 
with. 
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• V means for all. . . (the universal quantifier). 

Also, 3! is used to mean there exists a unique . . , 

For example, the set of even integers may be written as 
E — {a € Z 1 (3n € Z) a. — 2n } : 

in words, “all integers a such that there exists an integer n for which a = 2 n”. In 
this case we could replace 3 by 3! without changing the set — but that has to do 
with properties of Z, not with mathematical syntax. Also, it is common to adopt 
the shorthand 

E = {2 n | » € Z}, 

in which the existential quantifier is understood. 

Being able to parse such strings of symbols effortlessly, and being able to write 
them out fluently, is extremely important. The reader of this book is assumed to 
have already acquired this skill. 

Note that the order in which things are written may make a big difference. For 
example, the statement 

(Va € Z) (36 € Z) 6 = 2a 

is true: it says that the result of doubling an arbitrary integer yields an integer; 
but 

(36 e Z) (Va <= Z) 6= 2a 

is false: it says that there exists a fixed integer 6 which is ‘simultaneously’ twice as 
much as every integer — there is no such tiling. 

Note also that writing simply 

6 = 2a 

by itself does not convey enough information, unless the context makes it completely 
clear what quantifiers are attached to a and 6: indeed, as we have just seen, different 
quantifiers may make this into a true or a false statement. 

1,2. Inclusion of sets. As mentioned above, two sets are equal if and only if 
they contain the same elements. We say that a set S is a subset of a set T if every 
element of S is an element of T, in symbols, 

SCT. 

By convention, S C T means the same thing: that is (unlike < vs. <), it does not 
exclude the possibility that S and T may be equal. To avoid any confusion, we will 
consistently use C in this book. One adopts S gj T to mean that S is ‘properly’ 
contained in T: that is, S C T and S ^ T. 

We can think of ‘inclusion of sets’ in terms of logic: SCT means that 

a € S => s € T 

(the quantifier Vs is understood); that is, ‘if s is an element of S, then s is an element 
of T’; that is, all elements of S are elements of T\ that is, S C T as promised. 
Note that for all sets 5, 0 C S and S CS. 

VSCTmdTCS, then S = T. 
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The symbol |S| denotes the number of elements of S, if this number is finite; 
otherwise, one writes |S| = oo. If S and T are finite, then 

SCT => |S|< |T|. 

The subsets of a set S form a set, called the power set, or the set of parts 
of S. For example, the power set of the empty set 0 consists of one element: 
{0}. The power set of S is denoted -^(S); a popular alternative is 2 s , and indeed 
|.^(S)| = 2l s * if S is finite (cf. Exercise 2.11). 

1.3. Operations between sets. Once we have a few sets to play with, we can 
obtain more by applying certain standard operations. Here are a few: 

• U: the union; 

• fl: the intersection; 

• \ : the difference; 

• II : the disjoint union; 

• x: the (Cartesian) product; 

• and the important notion of ‘quotient by an equivalence relation’. 

Most of these operations should be familiar to the reader: for example, 

{1)2,4} U {3,4,5} = {1,2, 3, 4, 5} 

while 

{1,2,4} s {3,4,5} ={1,2}. 

In terms of Venn diagrams of infamous ‘new math’ memory: 



SuT SnT S\T 


(the solid black contour indicates the set included in the operation). 

Several of these operations may be written out in a transparent way in terms 
of logic: thus, for example, 

s€Snr<=$- (s€ Sand s 6 T). 

Two sets S and T are disjoint if S n T = 0, that is, if no element is ‘simulta- 
neously’ in both of them. 

The complement of a subset T in a set S is the difference set S \ T consisting 
of all elements of S which are not in T. Thus, for example, the complement of the 
set of even integers in 2 is the set of odd integers. 

The operations II, X, and quotients by equivalence relations are slightly more 
mysterious, and it is very instructive to contemplate them carefully. We will look 
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at them in a particularly naive -way first and come back to them in a short while 
when we haw acquired more language and can view them from a more sophisticated 
viewpoint. 

1.4. Disjoint unions, products. One problem with these operations is that their 
output may not be defined os a set, but rather as a set up to isomorphisms of sets, 
that is, up to bijections. To make sense out of this, we have to talk about functions, 
and we will do that in a moment. 

Roughly speaking, the disjoint union of two sets 5 and T is a set SIIT obtained 
by first producing ‘copies’ S' and T' of the sets S and T, with the property that 
S' nT' — 0, and then taking the (ordinary) union of S' and T'. The careful reader 
will feel uneasy, since this ‘recipe’ does not define one set: whatever it means to 
produce a ‘copy’ of a set, surely there are many ways to do so. This ambiguity will 
be clarified below. 

Nevertheless, note that we can say something about S II T even on these very 
shaky grounds: for example, if S consists of 3 elements and T consists of 4 elements, 
the reader should expect (correctly) that S II T consists of 7 elements. 

Products are marred by the same kind of ambiguity, but fortunately there is 
a convenient convention that allows us to write down ‘one’ set representing the 
product of two sets S and T: given S and T, we let S x T be the set whose 
elements are the ordered pairs 4 (s, t) of elements of S and T: 

S xT := {(«, t) such that s e S, t e T}. 

Thus, if S = {1, 2, 3} and T = {3,4}, then 

S x T = {(1, 3), (1, 4), (2, 3), (2, 4), (3, 3), (3, 4)}. 

For a more sophisticated example, R x IS is the set of pairs of real numbers, which 
(as we learn in calculus) is a good way to represent a plane. The set Z x Z could 
be represented by considering the points in this plane that happen to have integer 
coordinates. Incidentally, it is common to denote these sets R 2 , Z 2 ; and similarly, 
the product A x A of a set by itself is often denoted A 2 . 

If S and T are finite sets, clearly |£ x T| = |S| |T|. 

Also note that we can use products to obtain explicit ‘copies’ of sets as needed 
for the disjoint union: for example, we could let S' = {0} x S, T = {1} x T, 
guaranteeing that S' and T' are disjoint (whyY); and there is an evident way to 
‘identify’ S and S', T and T'. Again, making this precise requires a little more 
vocabulary. 

The operations U, D, II, x extend to operations on whole ‘families’ of sets: for 
example, if S \ , . . . , S n are sets, we write 

n 

f]Si = Sir\s 2 r\^ns n 

i = 1 


4 One can define the ordered pair (s, t) as a set by setting (s,t) = this carries the 

information of the elements s> t t as well as conveying the fact that s is special (= the first element 
of the pair). 
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for the set whose elements are those elements which are simultaneously elements of 
all sets Si, , S n ; and similarly for the other operations. But note that while it is 
clear from the definitions that, for example, 

Si U S 2 U S 3 = (Si U S 2 ) U S 3 = Si U (S 2 U S 3 ), 

it is not so clear in what sense the sets 

Si x S 2 x S 3 , (Si x S 2 ) x S 3 , Si x (S 2 x S 3 ) 

should be ‘identified’ (where we can define the leftmost set as the set of ‘ordered 
triples’ of elements of Si, S 2 , S 3 , by analogy with the definition for two sets). In 
fact, again, we can really make sense of such statements only after we acquire the 
language of functions. However, all such statements do turn out to be true, as 
the reader probably expects; by virtue of this fortunate circumstance, we can be 
somewhat cavalier and gloss over such subtleties. 

More generally, if & is a set of sets, we may consider sets 

\JS, f|S> II S > II S ’ 

s^y s$y s$y s^y 

for the union, intersection, disjoint union, product of all sets in Sf . There are 
important subtleties concerning these definitions: for example, if all S € y axe 
nonempty, does it follow that Tls^y & * s nonempty? The reader probably thinks 
so, but (if y is infinite) this is a rather thorny issue, amounting to the axiom, of 
choice. 

By and large, such subtleties do not affect the material in this course; we will 
partly come to terms with them in due time 5 , when they become more relevant to 
the issues at hand (cf. §V.3). 

1.5. Equivalence relations, partitions, quotients. Intuitively, a relation on 
elements of a set S' is some special affinity among selections of elements of S. For 
example, the relation < on the set Z is a way to compare the size of two integers: 
since 2 < 5, 2 ‘is related to’ 5 in this sense, while 5 is not related to 2 in the same 
sense. 

For all practical purposes, what a relation ‘means’ is completely captured by 
which elements are related to which elements in the set. We would really know all 
there is to know about < on Z if we had a complete list of all pairs (a, b) of integers 
such that a < b. For example, (2, 5) is such a pair, while (5, 2 ) is not. 

This leads to a completely straightforward definition of the notion of relation: 
a relation on a set S’ is simply a subset R of the product S x S. If (a, b) € R, we 
say that a and b are ‘related by R’ and write 

aRb. 

Often we use fancier symbols for relations, such as <,<,=, ~, 


s The reader will have to employ the axiom of choice in some exercises every now and then, 
even before we come back to these issues, but this will probably happen below the awareness level, 
and so it should. 
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The prototype of a well-behaved relation is “=’, which corresponds to the ‘di- 
agonal’ 

{(o, 6)eSxS|a = fc} = {(a, a) | a 6 S} C S x 5. 

Three properties of this very special relation turn out to be particularly important: 
if ~ denotes for a moment the relation = of equality, then ~ satisfies 

• reflectivity: (Va £ S) a ~ a; 

• symmetry: (Va e S) (V6 e S) 5 (JJ 

• transitivity: (Va € S) (V6 € S) (Vc € S), (a ~ b and b ~ c) =>• a ~ c. 

That is, every a is equal to itself; if a is equal to 6, then i> is equal to a; etc. 

Definition 1 . 1 . An equivalence relation on a set S is any relation ~ satisfying 
these three properties. j 

In terms of the corresponding subset R of S x S, ‘reflexivity’ says that the 
diagonal is contained in R; ‘symmetry’ says that R. is unchanged if flipped about 
the diagonal (that is, if every (a, 6) is interchanged with (6, a)); while unfortunately 
‘transitivity’ does not have a similarly nice pictorial translation. 

The datum of an equivalence relation on S turns out to be equivalent to a type 
of information which looks a little different at first, that is, a partition of 5. A 
partition of S is a family of disjoint nonempty subsets of S, whose union is S: for 
example, 

.^={{1,4, 7}, {2, 5, 8}, {3, 6}, {9}} 

is a partition of the set 

{1,2, 3, 4, 5, 6, 7, 8, 9}. 

Here is how to get a partition of S from a relation ~ on 5: for every element 
a € S', the equivalence class of a (w.r.t. ~) is the subset of S defined by 

l°]~ := {h€ S|6 ~ a}; 

then the equivalence classes form a partition of S (Exercise 1.2). 

Conversely (Exercise 1.3) every partition iP is the partition corresponding in 
this fashion to an equivalence relation. Therefore, the notions of ‘equivalence rela- 
tion on S’ and ‘partition of S’ are really equivalent. 

Now we can view as a set (whose elements are the equivalence classes with 
respect to ~). This is the quotient operation mentioned in §1.3. 

Definition 1.2. The quotient of the set S with respect to the equivalence relation ~ 
is the set 

S/~ := 

of equivalence classes of elements of S with respect to ~. j 

Example 1.3. Take S = Z, and let ~ be the relation defined by 

a ~ b <=$• a — b is even. 

Then Z/~ consists of two equivalence classes: 

Z/~={(0]~, (!)-}• 
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Indeed, every integer 6 is either even (and hence b — 0 is even, so 6 ~ 0, and b € [0]~) 

or odd (and hence b— 1 is even, so b ~ 1, and b € [1] ). This is of course the starting 

point of modular arithmetic, which we will cover in due detail later on (§11.2.3). j 

One way to think about this operation is that the equivalence relation ‘becomes 
equality in the quotient’: that is, two elements of the quotient S/~ are equal if and 
only if the corresponding elements in S are related by ~. In other words, taking a 
quotient is a way to turn any equivalence relation into an equality. This observation 
will be further formalized, in ‘categorical terms’ in a short while (§5.3). 


Bxercises 


Exercises marked with a t> are referred to from the text; exercises marked with a 
-> are referred to from other exercises. These referring exercises and sections are 
listed in brackets following the current exercise; see the introduction for further 
clarifications, if necessary. 

1.1. Locate a discussion of Bussell’s paradox, and understand it. 

1.2. > Prove that if ~ is a relation on a set S, then the corresponding family 
defined in §1.5 is indeed a partition of S: that is, its elements are nonempty, 

disjoint, and their union is S. [§1.5] 

1.3. > Given a partition & on a set S, show how to define a relation ~ on S such 
that & is the corresponding partition. [§1.5] 

1.4. How many different equivalence relations may be defined on the set (1,2,3}? 

1.5. Give an example of a relation that is reflexive and symmetric but not transitive. 
What happens if you attempt to use this relation to define a partition on the set? 
(Hint: Thinking about the second question will help you answer the first one.) 

1.0. > Define a relation ~ on the set R of real numbers by setting a ~ b <==$■ b—a e 
Z. Prove that this is an equivalence relation, and find a ‘compelling’ description 
for R/~. Do the same for the relation « on the plane RxK defined by declaring 
(«i,« 2 ) « (b), bj) ■*=$> bi — «i € Z and b 2 — € Z. [§11.8.1, II.8.10] 


2. Functions between sets 

2.1. Definition. A common thread we will follow for just about every structure 
introduced in this book will be to try to understand both the type of structures 
and the ways in which different instances of a given structure may interact. 

Sets interact with each other through functions. It is tempting to think of a 
function / from a set A to a set B in ‘dynamic’ terms, as a way to ‘go from A 
to B\ Similarly to the business with relations, it is straightforward to formalize 
this notion in ways that do not need to invoke any deep ‘meaning’ of any given /: 
everything that can be known about a function / is captured by the information of 





2. Functions between sets 


9 


which dement b of B is the image of any given element a of A. This information 
is nothing but a subset of Ax B: 

17 := {(«,&) e A xB\b = f{a)} CAxB. 

This set 17 is the graph of /; officially, a function really ‘is’ its graph 6 . 

Not all subsets rcixfi correspond to (‘are’) functions: we need to put one 
requirement on the graphs of functions, which can be expressed as follows: 

(Vo € A) (3!fr € B) ( a,b)<sT f , 
or (‘in functional notation’) 

(Vo e A) (3!& e B) f(a) = b. 

That is, a function must send each element a of A to exactly one element of B, 
depending on a. ‘Multivalued functions’ such as ±\/s (which are very important 
in, e.g., the study of Riemann surfaces) are not functions in this sense. 

To announce that / is a function from a set A to a set B, one writes f : A —> B 
or dram's the following picture (‘diagram’): 



The action of a function / : A — > B on an element a € A is sometime indicated by 
a ‘decorated’ arrow, as in 

a*-* f(a). 

The collection of all functions from a set A to a set B is itself a set 7 , denoted 
B a . If we take seriously the notion that a function is really the same thing as its 
graph, then we can view B A as a (special) subset of the power set of A x B. 

Every set A comes equipped with a very special function, whose graph is the 
diagonal in A x A: the identity function on A 

id^ : A—* A 

defined by (Vo € A) id , 4 (a) = o. More generally, the inclusion of any subset S of a 
set A determines a function S -* A, simply sending every element s of S to ‘itself’ 
in A. 

If S is a subset of A, we denote by f(S) the subset of B defined by 
f(S) := {&eB|(ao€4)& = /(o)}. 

That is, f(S) is the subset of B consisting of all elements that are images of elements 
of S by the function /. The largest such subset, that is, f(A), is called the image 
of f, denoted ‘im /’. 

Also, /|s denotes the ‘restriction’ of / to the subset S: this is the function 
S —* B defined by 

(VseS): f\s(s) = f(s). 

^To be precise, it is the graph T/ together with the information of the source A and the 
target B of /. These are part of the data of the function. 

7 This is another ‘operation among sets*, not listed in §1.3. Can you see why we use B A for 
this set? (Cf. Exercise 2,10.) 
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That is, /|s is the composition (in the sense explained in the next subsection) foi, 
where i : S — ► A is the inclusion. Note that f(S) = im(/|s). 

2.2. Examples: Multisets, indexed sets. The ‘multisets’ mentioned briefly in 
§1.1 are a simple example of a notion easily formalized by means of functions. A 
multiset may be defined by giving a function from a (regular) set A to the set 
N* of positive 8 integers; if m : A — > N* is such a function, the corresponding 
multiset consists of the elements a £ A, each taken m(a) times. Thus, the multiset 
{a, a, a, b, b, b, b, b, c} is really the function m : {a, 6, c} — > N* for which m(a) = 3, 
m(b) = 5, m(c) = 1. As with ordinary sets, the order in which the elements are 
listed is not part of the information carried by a multiset. Simple set-theoretic 
notions such as inclusion, union, etc., extend in a straightforward way to multisets. 
For another viewpoint on multisets, see Exercise 3.9. 

Another example is given by the use of ‘indices’. If we write let ai,...,a n 

be integers. . . , we really meau consider a function a : {1 n} — » Z. . . , with 

the understanding that Oj is shorthand for the value a(i) (for i = 1, . . . ,n). It is 
tempting to think of an indexed set {a, her simply as a set whose elements happen 
to be denoted a*, for i ranging over some ‘set of indices’ I; but such an indexed set 
is more properly a function I — » A, where A is some set from which we draw the 
elements For example, this allows us to consider oo and ai as distinct elements 
of even if by coincidence ag = aj as elements of the target set A. 

It is easy to miss such subtleties, and some abuse of notation is common and 
usually harmless. These distinctions play a role in (for example) discussions of 
linear independence of sets of vectors; cf. §VI.1.2. 

2.3. Composition of functions. Emotions may he composed: if / : A — » B and 
g : B —> C are functions, then so is the operation go f defined by 

(*) (Vc e A) (gof)(a) := *(/(«)): 

that is, we use / to go from A to B, then apply g to reach C. Graphically we may 
draw pictures such as 



C 


Such graphical representations of collections of (for example) sets connected by 
functions are called diagrams. We will draw many diagrams, and in contexts sub- 
stantially more general than the one at hand right now. 

We say that the diagrams drawn above ‘commute’, or ‘are commutative’, mean- 
ing that if we start from A and travel to C in either of the two possible ways pre- 
scribed by the diagram, the result of applying the functions one encounters is the 
same. This is precisely the content of the statement (*). 


"Some references allow 0 as a possible multiplicity. 
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Composition is associative: that is, if / : A — * B, g : B — ► C, and. h : C — * D 
are functions, then h o (g o f) = (ho g)o f. Graphically, the diagram 


hog 



gof 


commutes. This important observation should be completely evident from the 
definition of composition. 

The identity function is very special with respect to compositions: if / : A — > B 
is any function, then ids o / = / and / o kU = /. Graphically, the diagrams 



/ f 


commute. 

2.4. Injections, surjections, bisections. Special kinds of functions deserve high- 
lighting: 

• A function / : A — » B is injective (or an injection or one-to-one ) if 

a' ^ a" ==» f(a!) ^ /(a”) : 

that is, if / sends different elements to different elements 9 . 

• A function / : A — » B is surjective (or a surjection or onto) if 

(V6eB)(3aeA) 6= f(a): 

that is, if / ‘covers the whole of B'\ more precisely, if im / = B. 

Injections are often drawn <-»; surjections are often drawn -**. 

If / is both injective and surjective, we say it is bijective (or a bijection or a 
one-to-one correspondence or an isomorphism of sets.) In this case we often write 
/ : A B, or 

A = B, 

and we say that A and B are ‘isomorphic’ sets. 

Of course the identity function id^j : A A is a bijection. 

If A = B, that is, if there is a bijection f : A —* B, then the sets A and B may 
be ‘identified’ through /, in the sense that we can match precisely the elements a 
of A with the corresponding elements /(a) of B. For example, if A is a finite set 
and A = B, then B is necessarily also a finite set and |A| = |B|. 

This terminology allows us to make better sense of the considerations on ‘dis- 
joint union’ given in §1.3: the ‘copies’ A', B 1 of the given sets A, B should simply 

9 Often one checks this definition in the contrapoaitive (hence equivalent) formulation, that is, 
(Va' € /t) (Va" € A) f(a!) = f(a") =f a‘ = a". 
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be isomorphic sets to A, B, respectively. The proposal given at the end of §1.4 to 
produce such disjoint ‘copies’ works, because (for example) the function 


defined by 

is manifestly a hijection. 


/ : A - {0} x A 
(VaeA) /(a) = (0 ,a) 


2.5. Injections, surjections, bijections: Second viewpoint. There is an al- 
ternative and instructive way to think about these notions. 

If / : A — * B is a bijection, then we can ‘flip its graph’ and define a function 

g:B-*A\ 

that is, we can let a = g(b) precisely when b = /(a). (The fact that / is both 
injective and surjective guarantees that the flip of T / is the graph of a function 
according to the definition given in §2.1. Check this!) 

This function g has a very interesting property: graphically, 



id.4 id B 


commute; that is, go f — fcU and fog— idg. The first identity tells us that g is 
a ‘left-inverse’ 10 of /: the second tells us that g is a ‘right-inverse’ of /. We simply 
say that it is the inverse of f, denoted / -1 . Thus, ‘bijections have inverses’. 

What about the converse? If a function has an inverse, is it a bijection? This 
is true, but in fact we can be much more precise. 

Proposition 2.1. Assume A ^ 0, and let f : A-* B be a function. Then 

(1) f has a left-inverse if and only if it is injective. 

(2) f has a right-inverse if and only if it is surjective. 


Proof. Let’s prove (1). 

( =» ) If / : A -» B lias a left-inverse, then there exists a g : B — *■ A such that 
go f = id^. Now assume that a' ^ a" are arbitrary different elements in A; then 

»(/(«')) = A (“') = o' / a" = id^(a") = g(f(a")); 


that is, g sends /(o') aud /(a") to different elements. This forces /(o') and /(a") 
to be different, showing that / is injective. 


( ■$= ) Now assume / : A — » B is injective. In order to construct a function 
g : B — ► A, we have to assign a unique value g(b) e A for each element be B. For 
this, choose any fixed element s € A (which we can do because A ^ 0); then set 


M) == 



if b = f(a) for some a € A, 
if b £ imf. 


10 Never mind that g is drawn, to the right of / in the diagram — we say that g is a fc(/£-inverse 
of / because it is written to the left of /: go f = id^. 
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In words, if b is the image of an element a of A, send it back to a; otherwise, send 
it to your fixed element s. 

The given assignment defines a function, precisely because / is injective: indeed, 
this guarantees that every b that is the image of some a € A by / is the image of a 
unique a (two distinct elements of A cannot be simultaneously sent to b by /, since 
/ is injective). Thus every b € B is sent to a unique well-defined element of A, as 
is required of functions. 

Finally, the function g : B — * A is a left-inverse of /. Indeed, if a € A, then 
b — /(a) is of the first type, so it is sent back to a by g; that is, <?o/(a) = a = idjt(o) 
for all a € A, as needed. 

The proof of (2) is left as an exercise (Exercise 2.2). □ 

Corollary 2.2. A function f : A — * B is a bijecHon if and only if it has a (two- 
sided) inverse. 

If a function is injective but not surjective, then it will not have a right-inverse, 
and it will necessarily have more than one left-inverse (this should be clear from 
the argument given in the proof of Proposition 2.1). Similarly, a surjective function 
will in general have many right-inverses; they are often called sections. 

Proposition 2.1 hints that something deep is going on here. The definition 
of injective and surjective maps given in §2.4 relied crucially on working directly 
with the elements of our sets; Proposition 2.1 shows that in fact these properties 
are detected by the way functions are ‘organized’ among sets. Even if we did not 
know what ‘elements’ means, still we could make sense of the notions of injectivity 
and surjectivity (and hence of isomorphisms of sets) by exclusively referring to 
properties of functions. 

This is a more ‘mature’ point of view and one that will be championed when we 
talk about categories. To some extent, it should cure the reader from the discomfort 
of talking about ‘elements’, as we did in our informal introduction to sets, without 
defining what these mysterious entities are supposed to be. 

The standard notation for the inverse of a bijection / is / _1 , This symbol is 
also used for functions that are not bijections, but in a slightly different context: if 
/ : A — » B is any function and T C B is a subset of B, then / -1 (T) denotes the 
subset of A of ‘all elements that map to T’; that is, 

f- l (T)={aeA\f(a)eT}. 

If T = {<?} consists of a single element of B, / -1 (T) (abbreviated / -1 (g)) is called 
the fiber of f over q. Thus a function / : A — * B is a bijection if it has nonempty 
fibers over all elements of B (that is, / is surjective), and these fibers are in fact 
singletons (that is, / is injective). In this case, this notation / -1 matches nicely 
with the notation of ‘inverse’ mentioned above. 

2.6. Monomorphisms and epimorphisms. There is yet another way to express 
injectivity and surjectivity, which appears at first more complicated than what we 
have seen so far but which is in fact even more basic. 
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A function f : A B is a monomorphism (or monte) if the following holds; 
for all sets Z and all functions a? ,of ' : Z —* A 
f o a' = / o a" =*• o f = a". 

Proposition 2.3. A function is injective if and only if it is a monomorphism. 


Proof. ( =*■ ) By Proposition 2.1, if a function / : A — > B is injective, then it 
has a left-inverse g : B — * A. Now assume that of, of' are arbitrary functions from 
another set Z to A and that 

foot = /»/; 

compose on the left by g, and use associativity of composition: 

(gof)oa'=go(foa') = go(fo a") = (go f)o a"\ 
since g is a left-inverse of /, this says 

kU oa' = idyl 0 of'. 


and therefore 

cf = a", 

as needed to conclude that / is a monomorphism. 

( -^= ) Now assume that / is a monomorphism. This says something about 
arbitrary sets Z and arbitrary functions Z — ♦ A; we are going to use a microscopic 
portion of this information, choosing Z to be any singleton {p}. Then assigning 
functions of ,ol’ : Z — » A amounts to choosing to which elements a' = a'(p), 
a" = of' (p) we should send the single element p of Z. For this particular choice of 
Z, the property defining monomorphisms, 

/ o a' — f o a" =£• a' = a" 

becomes 

f o of (p) = f o of'(p) =*• of = of'\ 

that is 

/(«') = f( a ") =► cf = of'. 

Now two functions from Z = {p } to A are equal if and only if they send p to the 
same element, so this says 

f(a') = f(a ") =s> o' = a". 

This has to be true for all of, a", that is, for all choices of distinct a', a" in A. In 
other words, / has to be injective, as was to be shown. □ 


The reader should now expect that there be a definition in the style of the one 
given for monomorphisms and which will turn out to be equivalent to ‘surjective’. 
This is the case: such a notion is called epimorphism. Finding it, and proving 
the equivalence with the ordinary definition of ‘surjective’, is left to the reader 11 
(Exercise 2.5). 

11 This is a particularly important exercise, and we recommend that the reader write out all 
the gory details carefully. 
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2.7. Basic examples. The basic operations on sets provided us with several im- 
portant examples of injective and surjective functions. 

Example 2.4. Let A, B be sets. Then there are natural projections tca, itg: 

Ax B 


A B 

defined by 

?M((a,6)) := a, n B ((a,b)) := b 

for all (a, 6) € A x B. Both of these maps are (clearly) surjective. j 




Example 2.5. Similarly, there are natural injections from A and B to the disjoint 
union: 



AUB 


obtained by sending a € A (resp., 6 e B) to the corresponding element in the 
isomorphic copy A' of A (resp., B' of B) in AU.B. _i 


Example 2.6. If ~ is an equivalence relation on a set A, there is a (clearly surjec- 
tive) canonical projection 


A 


A/~ 


obtained by sending every a 6 A to its equivalence class [a]„ 


2.8. Canonical decomposition. The reason why we focus our attention on in- 
jective and surjective maps is that they provide the basic ‘bricks’ out of which any 
function may be constructed. 

To see this, we observe that every function / : A — ► B determines an equivalence 
relation ~ on A as follows: for all a ! , a" € A, 

a! ~ a" <S=f> /(a') = f(a"). 

(The reader should check that this is indeed an equivalence relation.) 


Theorem 2.7. Let f : A — * B be any function, and define ~ as above. Then f 
decomposes as follows: 


f 


A 



B 


where the first function is the canonical projection A —* Aj~ (as in Example 2.6), 
the third function is the inclusion im / C B, and the bijection f in the middle is 
defined by 


for all a 6 A. 


/(M~) == /(») 
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The formula defining / shows immediately that the diagram commutes; so all 
we have to verify in order to prove this theorem is that 

• that formula does deline a function; 

• that function is in fact a bijection. 

The first item is an instance of a class of verifications of the utmost importance. 
The formula given for / has a colossal built-in ambiguity: the same element in A/~ 
may be the equivalence class of many elements of A\ applying the formula for / 
requires choosing one of these elements and applying / to it. We have to prove 
that the result of this operation is independent from this choice: that is, that all 
possible choices of representatives for that equivalence class lead to the same result. 

We encode this type of situation by saying that we have to verify that / is 
well-defined. We will often haw to check that the operations we consider are well- 
defined, in contexts very similar to the one epitomized here. 


Proof. Spelling out the first item discussed above, we have to verify that, for all 
a', a" in A, 

WU = [a"U =► /(a') = /(«")• 

Now [a']^ = [a"]~ means that o' ~ a", and the definition of ~ has been engineered 
precisely so that this would mean /(o') = /(a") as required here. So / is indeed 
well-defined. 

To verify the second item, that is, that / : A/~ — > im / is a bijection, we check 
explicitly that / is injective and surjective. 

Injective: If /((o']~) = /([a"l~)> then /(o') = /(a") by definition of /; hence 
o' ~ o" by definition of ~, and then [o'j^, = [a"]~ • Therefore 

/(K)-) = /(Kl-) =* = Ki- 


proving injectivity. 

Surjective: Given any b € im /, there is an element a 6 A such that /(«) = b. 
Then 

/u«]~) = m = b 

by definition of /. Since b was arbitrary in im /, this shows that / is surjective, as 
needed. □ 


Theorem 2.7 shows that every function is the composition of a surjection, fol- 
lowed by an isomorphism, followed by an injection. While its proof is trivial, this is 
a result of some importance, since it is the prototype of a situation that will occur 
several times in this book. It will resurface every now and then, with names such 
as ‘the first isomorphism theorem’. 

2.9. Clarification. Finally, we can begin to clarify one comment about disjoint 
unions, products, and quotients, made in §1.4. Our definition of .4 III? was the 
(conventional) union of two disjoint sets A 1 , B' isomorphic to A, B, respectively. 
It is easy to provide a way to effectively produce such isomorphic copies (as we did 
in §1.4); but it is in fact a little too easy — many other choices are possible, and one 
does not look any better than any other. It is in fact more sensible not to make a 
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fixed choice once and for all and simply accept the fact that all of them produce 
acceptable candidates for A II B. From this egalitarian standpoint, the result of the 
operation A II B is not ‘well-defined’ as a set in the sense specified above. However, 
it is easy to see (Exercise 2.9) that A II B is well-defined up to isomorphism: that 
is, that any two choices for the copies A', B' lead to isomorphic candidates for 
A JIB. The same considerations apply to products and quotients. 

The main feature of sets obtained by taking disjoint unions, products, or quo- 
tients is not really 'what elements they contain’ hut rather ‘their relationship with 
all other sets’. This will be (even) clearer when we revisit these operations and 
others 12 in the context of categories. 


Exercises 

2.1. ▻ How many different bisections are there between a set S with n elements 
and itself? [§II.2,lJ 

2.2. t> Prove statement (2) in Proposition 2.1. Yon may assume that given a family 
of disjoint subsets of a set, there is a way to choose one element in each member of 
the family 13 . (§2.5, V.3.3] 

2.3. Prove that the inverse of a bijectiou is a bijectiou and that the composition 
of two bijeetions is a bijection. 

2.4. t> Prove that ‘isomorphism’ is an equivalence relation (on any set of sets). 

(§4.1] 

2.5. i> Formulate a notion of epimorphism, in the style of the notion of monomor- 
phism seen in §2.6, and prove a result analogous to Proposition 2.3, for epimor- 
phisms and surjections. (§2.6, §4.2] 

2.6. With notation as in Example 2.4, explain how any function / : A —* B 
determines a section of tr^. 

2.7. Let f : A—* B be any function. Prove that the graph T/ of / is isomorphic 
to A. 

2.8. Describe as explicitly as you can all terms in the canonical decomposition 
(cf. §2.8) of the function R-tC defined by r t-» e 2l ” r . (This exercise matches one 
assigned previously. Winch one?) 

2.9. > Show that if A' = A" and B' = B", and further A'CiB' — 0 and A"(~\B " = 0, 
then A! U B' = A" U B". Conclude that the operation A II B (as described in §1.4) 
is well-defined up to isomorphism (cf. §2.9). (§2.9, 5.7] 

2.10. > Show that if A and B are finite sets, then \B A \ = [§2.1, 2.11, §11.4.1] 

12 The reader should also be aware that there are important variations on the operations we 
have Been so far — particularly important are the fibered flavors of products aud disjoint unions. 
13 Tbis (reasonable) statement is the axiom of choice; cf. §V.3. 
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2.11. > In view of Exercise 2.10, it is not unreasonable to use 2 A to denote the set 
of functions from an arbitrary set A to a set with 2 elements (say {0, 1}). Prove 
that there is a bijection between 2 A and the power set of A (cf. §1.2). [§1.2, III.2.3] 


3. Categories 

The language of categories is affectionately known as abstract nonsense, so named 
by Norman Steeurod. This term is essentially accurate and not necessarily derogar 
tory: categories refer to nonsense in the sense that they are all about the ‘structure’, 
and not about the ‘meaning’, of what they represent. The emphasis is less on how 
you run into a specific set you are looking at and more on how that set may sit 
in relationship -with all other sets. Worse (or better) still, the emphasis is less on 
studying sets, and functions between sets, than on studying ‘things, and things 
that go from things to things’ without necessarily being explicit about what these 
things are: they may be sets, or groups, or rings, or vector spaces, or modules, or 
other objects that are so exotic that the reader has no right whatsoever to know 
about them (yet). 

‘Categories’ will intuitively look like sets at first, and in multiple ways. Cat- 
egories may make yon think of sets, in that they are ‘collections of objects', and 
further there will be notions of ‘functions from categories to categories’ (called func- 
tors 14 ). At the same time, every category may make you think of the collection of 
all sets, since there will be analogs of ‘functions’ among the tilings it contains. 

3.1. Definition. The definition of a category looks complicated at first, but the 
gist of it may be summarized quickly: a category consists of a collection of ‘objects’, 
and of ‘morphisms’ between these objects, satisfying a list of natural conditions. 

The reader will note that I refrained from writing a set of objects, opting for 
the more generic ‘collection’. This is an annoying, but unavoidable, difficulty: for 
example, we want to have a ‘category of sets’, in which the ‘objects’ are sets and 
the ‘morphisms’ are functions between sets, and the problem is that there simply 
is not a set of all sets 15 . In a sense, the collection of all sets is ‘too big’ to be a set. 
There are however ways to deal with such ‘collections’, and the technical name for 
them is class. There is a ‘class’ of all sets (and there will be classes taking care of 
groups, rings, etc.). 

An alternative would be to define a large enough set (called a universe) and 
then agree that all objects of all categories will be chosen from this gigantic entity. 

In any case, all the reader needs to know about this is that there is a way to 
make it work. We will use the term ‘class’ in the definition, but this will not affect 
any proof or any other definition in this book. Further, in some of the examples 
considered below the class in question is a set (we say that the category is small 
iu this case), so the reader will feel perfectly at home when contemplating these 
examples. 

^However, we will not consider functors until later chapters: our first formal encounter with 
functors will be in Chapter VIII. 

ls That is one thing we learn from Russell’s paradox. 
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Definition 3.1. A category C consists of 

• a class Obj(C) of objects of the category; and 

• for every two objects A, B of C, a set Hom c (A, B ) of morphisms, with the 

properties listed below. j 

As a prototype to keep in mind, think of the objects as ‘sets’ and of morphisms 
as ‘functions’. This one example should make the defining properties of morphisms 
look natural and easy to remember; 

• For every object A of C, there exists (at least) one morphism 1 a £ Home (A, A), 
the ‘identity’ on A. 

• One can compose morphisms: two morphisms / e Homc(A, B) and g € 
Homc(B, C) determine a morphism gf € Homc(A,C). That is, for every 
triple of objects A, B, C of C there is a function (of sets) 

Hom c (A, B) x Hom c (B,C) -*■ Hom c (A,C'), 
and the image of the pair (/, g) is denoted gf. 

• This ‘composition law’ is associative: if / 6 Homc(A,B), g € Homc(B, C), 
and h € Homc(C, D), then 

{%)/ = h(gf). 

• The identity morphisms are identities with respect to composition: that is, 
for all / e Homc(A, B) we have 

fU = /, 1b/ = /■ 

This is really a mouthful, but again, to remember all this, just think of functions 
of sets. One further requirement is that the sets 

Hom c (A, B), Home (C,D) 

be disjoint unless A = C, B = D\ this is something you do not usually think about, 
but again it holds for ordinary set-functions 16 . That is, if two functions are one and 
the same, then necessarily they have the same source and the same target: source 
and target are part of the datum of a set-function. 

A morphism of an object A of a category C to itself is called an endomorphism; 
Homc(A, A) is denoted Endc(A). One of the axioms of a category tells us that 
this is a ‘pointed’ set, as 1 a € Endc(A). The reader should note that composition 
defines an ‘operation’ on Endc(A): if f,g are elements of Endc(A), so is their 
composition gf. 

Writing ‘/ e Homc(A, B)’ gets tiresome in the long run. If the category is 
understood, one may safely drop the index C, or even use arrows as we do with 
set- functions: / : A -» B. This also allows us to draw diagrams of morphisms in 
any category; a diagram is said to ‘commute’ (or to be a ‘commutative’ diagram) if 
all ways to traverse it lead to the same results of composing morphisms along the 
way, just as explained for diagrams of functions of sets in §2.3. 

16 We will often use the term ‘set-function’ to emphasize that we are dealing with a function 
in the context of sets. 
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In fact, we will now feel free to use diagrams as possible objects of categories. 
The official definition of a diagram in this context would be a set of objects of a 
category C, along with prescribed morphiams between these objects; the diagram 
commutes if it does in the sense specified above. The specifics of the visual repre- 
sentation of a diagram are of course irrelevant. 

3.2. Examples. The reader should note that 90% of the definition of the notion 
of category goes into explaining the properties of its morphisms; it is fair to say 
that the morphisms are the important constituents of a category. Nevertheless, 
it is psychologically irresistible to think of a category in terms of its objects: for 
example, one talks about the ‘category of sets’. The point is that usually the kind 
of ‘morphisms’ one may consider are (psychologically at least) determined by the 
objects: if one is talking about sets, what can one possibly mean for ‘morphism’ 
other than a function of sets? In other situations (cf. Example 3.5 below or Exer- 
cise 3.9) it is a little less clear what the morphisms should be, and looking for the 
‘right’ notion may be an interesting project. 

Example 3.2. It is hopefully crystal dear by now that sets (as objects), together 
with set-functions (as morphisms), form a category; if not, the reader must stop 
here and go no further until this assertion sheds any residual mystery 17 . 

There is no universally accepted, official notation for this important category. 
It is customary to write the word ‘Set’ or ‘Sets’, with some fancy decoration for 
emphasis. For example, in the literature one may encounter Set, |j|£g, ©et, {Sets), 
and many amusing variations on these themes. We will use ‘sans-serif’ fonts to 
denote categories; thus, Set will denote the category of sets. Thus 

• Obj(Set) = the dass of all sets; 

• for A, B in Obj(Set) (that is, for A, B sets) Homset(/l, B) = B A . 

Note that the presence of the operations recalled in §§1.3-1 .5 is not part of the 
definition of category: these operations highlight interesting features of Set, which 
may or may not be shared by other categories. We will soon come back to some of 
these operations and understand more precisely what they say about Set. -i 

Example 3.3. Here is a completely different example. 

Suppose S is a set and ~ is a relation on S satisfying the reflexive and transitive 
properties. Then we can encode this data into a category: 

• objects: the elements of S; 

• morphisms: if a, i> are objects (that is, if a, 6 6 S), then let Hom(a, b) be the 
set consisting of the element (a, b) 6 S x S if a ~ 6, and let Hom(a, b) = (5 
otherwise. 


17 We will give the reader such prompts every now and then: at key times, it is more useful 
to take stock of what one knows than blindly march forward hoping for the best. A difficulty at 
this time signals the need to reread the previous material carefully. If the mystery pereiete, that's 
what office hours are there for. But typically you should be able to find your way out on your 
own, based on the information we have given you, and you will most likely learn more this way. 
You should give it your best try before seeking professional help. 
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Note that (unlike in Set) there are very few morphisms: at most one for any pair 
of objects, and no morphisms at all between ‘unrelated’ objects. 

We have to define ‘composition of morphisms’ and verify that the conditions 
specified in §3.1 are satisfied. First of all, do we have ‘identities’? If a is an object 
(that is, if a € S), we need to find an element 

l a e Hom(a,a). 

This is precisely why we are assuming that ~ is reflexive: this tells us that Vo, 
a ~ a; that is, Hom(a, a) consists of the single element (a, a). So we have no choice: 
we must let 

l a = (a, a) € Hom(a,a). 

As for composition, let a,b,c be objects (that is, elements of 5) and 
/ € Horn (a, 6), g £ Hom(6, c); 

we have to define a corresponding morphism gf £ Hom(a, c). Now, 

/ € Hom(o, b) 

tells us that Hom(a, 6) is nonempty, and according to the definition of morphisms 
in this category that means that a ~ 6, and / is in fact the element (o, ft) of A x B. 
Similarly, g e Hom(6, c) tells us b ~ c and g = (b, c). Now 

a ~ 6 and b ~ c =f- o ~ c 

since we are assuming that ~ is transitive. This tells us that Hom(a, c) consists of 
the single element (a, c). Thus we again have no choice: we must let 

gf := (a, c ) £ Hom(A, C). 

Is this operation associative? If / £ Hom(o, b), g £ Hom(6, c), and h £ Hom(c, d), 
then necessarily 

f=(a,b), g=(b,c), h=(c,d) 

and 

gf = (a, c), hg = (6, d) 

and hence 

Hgf) = (“i d) = (hg)f, 

proving associativity. 

The reader will have no difficulties checking that 1„ is an identity with respect 
to this composition, as needed (Exercise 3.3). 

The most trivial instance of this construction is the category obtained from a 
set S taken with the equivalence relation ‘=’; that is, the only morphisms are the 
identity morphisms. These categories are called discrete. 

As another example, consider the category corresponding to endowing Z with 
the relation <. For example, 

2 >3 *7 
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is a (randomly chosen) commutative diagram in this category. It would still be a 
(commutative) diagram in this category if we reversed the vertical arrow 3 — ♦ 3 or 
if we added au arrow from 3 to 4, while we are not allowed to draw an arrow from 
4 to 3, since 4^3. 

These categories are very special— for example, every diagram drawn in them 
is necessarily commutative, and this is very far from being the case in, e.g., Set. 
Also note that these categories are all small _i 

Example 3.4. Here is another 18 example of a small category. 

Let S again be a set. Define a category S by setting 

• Obj(S) = -9 s {S), the power set S (cf. §1.2 and Exercise 2.1 1); 

• for A, B objects of S (that is, A C 5 and B C S) let Hom§(A, B) be the pair 
(A, B) if A C B, and let Homg(A, B) = 0 otherwise. 

The identity 1 a consists of the pair ( A , A) (which is one, and in feet the only 
one, morphism from A to A, since AC A). Composition is obtained by stringing 
inclusions: if there are morphisms 

A -» B, B-*C 

in S, then A C B and SC C; hence ACC and there is a morphism A — ► C. 
Checking the axioms specified in §3.1 should be routine (make sure this is the 
case!). 

Examples in this style (but employing more sophisticated structures, such as 
the family of open subsets of a topological space) are hugely important in well- 
established fields such as algebraic geometry. j 

Example 3.5. The next example is very abstract, hut thinking about it will make 
you rather comfortable with everything we have seen so far; and it is a very common 
construction, variations of which wall abound in the course. 

Let C be a category, and let A be an object of C. We are going to define a 
category C a whose objects are certain morphisms in C and whose morphisms are 
certain diagrams of C (surprise!). 

• Obj(CA) = all morphisms from any object of C to A; thus, an object of C a is 
a morphism / e Homc(Z, A) for some object Z of C. Pictorially, an object of 

Ca is an arrow Z -L A in C; these are often drawn ‘top-down’, as in 

Z 

t 

A 

What are morphisms in Ca going to be? There really is only one sensible way to 
assign morphisms to a category -with objects as above. The brave reader will want 
to stop reading here and continue only after having come up with the definition 
independently. There will be many similar examples lurking behind constructions 

18 Actually, this is again an instance of the categories considered in Example 3.3. Do you see 
why? (Exercise 3.5.) 
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we will encounter in this book, and ideally speaking, they should appear completely 
natural when the time comes. A bit of effort devoted now to understanding this 
prototype situation will have ample reward in the future. Spoiler follows, so put 
these notes away now and jot down the definition of morphism in C 4 . 


Welcome back. 


• Let / 1 , be objects of C^, that is, two arrows 


Zi 

h 

A 


h 


in C. Morph isms fi — + fi are defined to be commutative diagrams 


Zt 



A 


in the ‘ambient’ category C. 

That is, morphisms / —* g correspond precisely to those morphisms o : Z\ —* Zj 
in C stick that fi = ficr. 

Once you understand what morphisms have to be, checking that they satisfy 
the axioms spelled out in §3.1 is straightforward. The identities are inherited from 
the identities in C: for / : Z — » A in C 4 , the identity 1 / corresponds to the diagram 


Z * Z 



A 


which commutes by virtue of the feet that C is a category. Composition is also a 
subproduct of composition in C. Two morphisms /1 -» fa — ► h in correspond 
to putting two commutative diagrams side-by-side: 


Zi—^Zi—^Zs 



A 


and then it follows (again because C is a category!) that the diagram obtained by 
removing the central arrow, i.e,, 
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also commutes. Check all this(!), and verify that composition in Ca is associative 
(again, this follows immediately from the fact that composition is associative in C). 

Categories constructed in this fashion are called slice categories in the literature; 
they are particular cases of comma categories. _i 

Example 3.6. For the sake of concreteness, let’s apply the construction given in 
Example 3.5 to the category constructed in Example 3.3, say for S = Z and ~ the 
relation <. Call C this category, and choose an object A of C- — that is, an integer, 
for example, A = 3. Then the objects of are inorphisius in C with target 3, that 
is, pairs (n, 3) C 2 x Z with n < 3. There is a morphism 

(m,3) -» (n, 3) 

if and only if m < n. In this case Ca may be harmlessly identified with the 
‘subcategory’ of integers < 3, with ‘the same’ morphisms as in C. j 

Example 3.7. An entirely similar example to the one explored in Example 3.5 may 
be obtamed by considering morphisms in a category C from a iixed object A to all 
objects in C, again with morphisms defined by suitable commutative diagrams. This 
leads to coslice categories. The reader should provide details of this construction 
(Exercise 3.7). j 

Example 3.8. As a ‘concrete’ instance of a category as in Example 3.7, let C = Set 
and A — a fixed singleton {*}. Call the resulting category Set*. 

An object in Set* is then a morphism / : {*} — » S in Set, where S is any set. 
The information of an object in Set* consists therefore of the choice of a nonempty 
set S and of an element s € S — that is, the element /(*): tins element determines, 
and is determined by, /. 

Thus, we may denote objects of Set* as pairs (S', s), where S is any set and 
s € S' is any element of S. 

A morphism between two such objects, (5, s) — > (T,t), corresponds then (check 
this!) to a set-function a : S — * T suck that cr(s) = t. 

Objects of Set* are called ‘pointed sets’. Many of the structures we will study 
in this book will be pointed sets. For example (as we will see) a ‘group’ is a set G 
with, among other requirements, a distinguished element cq (its ‘identity’); ‘group 
homomorphisms’ will be functions which, among other properties, send identities to 
identities; thus, they are morphisms of pointed sets in the sense considered above. 
-1 

Example 3.9. It is useful to contemplate a few more ‘abstract’ examples in the 
style of Examples 3.5 and 3.7. These will be essential ingredients in the promised 
revisitation of some of the operations mentioned in §1.3. Their definition will appear 
disappointedly simple-minded to the reader who Iras mastered Examples 3.5 and 3.7. 

This time we start from a given category C and two objects A, B of C. We 
can define a new category by essentially the same procedure that we used in 
order to define C^: 
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• Obj(C A;B ) = diagrams 



in C; and 
• morphisms 



A 


B 


We will leave to the reader the task of formalizing this rough description. This 
example is really nothing more than a mixture of C/i and C B , where the two struc- 
tures interact because of the stringent requirement that the same o must make both 
sides of the diagram commute: 

/l = f 2 o and Si = 92<r 

‘simultaneously’. 

Flipping most of the arrows gives an analogous variation of Example 3.7, pro- 
ducing a category which we may 19 denote C A,B ; details are left to the reader. _i 
Example 3.10. As a final variation on these examples, we conclude by considering 
the fibered version of C a,b (and C A,B ). Take this as a test to see if you have really 
understood C a,B — experts would tell you that this looks fairly sophisticated for 
students just learning categories, so don’t get disheartened if it does not How too 
well at first (but pat yourself on the shoulder if it does!). Start with a given 
category C, and this time choose two fixed morphisms a : A — » C, 0 : B — > C in C, 
with the same target C. We can then consider a category C Q)J g as follows: 

• Obj(C o> 0 ) = commutative diagrams 



in C, and 

* 9 There does not seem to be an established notation for these commonplace categories. 
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• morphisms correspond to commutative diagrams 



A solid understanding of Example 3.9 will make this example look just as tame; at 
this point the reader should have no difficulties formalizing it (that is, explaining 
how composition works, what identities are, etc.). 

Also left to the reader is the construction of the ‘mirror’ example C 0 ’^, starting 
from two morphisms oc : C -* A, 0 : C —* B with common source. j 


Exercises 


3.1. > Let C be a category. Consider a structure C° p with 

• Obj(C op ) := Obj(C); 

• for A, B objects of C op (hence objects of C), Home’* (A, B) := Hom<;(B, A). 

Show how to make this into a category (that is, define composition of morphisms 
in C op and verify the properties listed in §3.1). 

Intuitively, the ‘opposite’ category C° p is simply obtained by ‘reversing all the 
arrows’ in C. [5.1, §VIII.1.1, §IX.1.2, IX.1.10] 

3.2. If A is a finite set, how large is Ends*t(A)? 

3.3. t> Formulate precisely what it means to say that l a is an identity with respect 
to composition in Example 3.3, and prove this assertion. [§3.2] 

3.4. Can we define a category in the style of Example 3.3 using the relation < on 
the set Z? 

3.5. i> Explain in what sense Example 3.4 is an instance of the categories considered 
in Example 3.3. [§3.2] 

3.6. t> (Assuming some familiarity with linear algebra.) Define a category V by 
taking Obj(V) = N and letting Homv(n, to) = the set of m x n matrices with real 
entries, for all n, m € N. (We will leave the reader the task of making sense of a 
matrix with 0 rows or columns.) Use product of matrices to define composition. 
Does this category ‘feel’ familiar? [§VI.2.1, §VIII.1.3] 

3.7. t> Define carefully objects and morphisms in Example 3.7, and draw the dia- 
gram corresponding to composition. [§3.2] 

3.8. t> A subcategory C' of a category C consists of a collection of objects of C, with 
morphisms Hom^y (A,B) C Homc(A,B) for all objects A, B in Ob^C’), such that 
identities and compositions in C make C' into a category. A subcategory C' is full 
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if Homc'(j4, B) = Homc(.4,B) for all A, B in Obj(C'). Construct a category of 
infinite sets and explain how it may be viewed as a full subcategory of Set. [4.4, 
§VI.1.1, §VIIL1.3] 

3.9. t> An alternative to the notion of multiset introduced in §2.2 is obtained by 
considering sets endowed with equivalence relations; equivalent elements are taken 
to be multiple instances of elements ‘of the same kind’. Define a notion of morphism 
between such enhanced sets, obtaining a category MSet containing (a ‘copy’ of) Set 
as a full subcategory. (There may be more than one reasonable way to do this! 
This is intentionally an open-ended exercise.) Which objects in MSet determine 
ordinary multisets as defined in §2.2 and how? Spell out what a morphism of 
multisets would be from this point of view. (There are several natural notions 
of morphisms of multisets. TYy to define morphisms in MSet so that the notion 
you obtain for ordinary multisets captures your intuitive understanding of these 
objects.) [§2.2, §3.2, 4.5] 

3.10. Since the objects of a category C are not (necessarily) sets, it is not clear 
how to make sense of a notion of ‘subobject’ in general. In some situations it does 
make sense to talk about subobjects, and the subobjects of any given object A in 
C are in one-to-one correspondence with the morphisms A — > O for a fixed, special 
object fl of C, called a subobject classifier. Show that Set has a subobject classifier. 

3.11. t> Draw the relevant diagrams and define composition and identities for the 
category C A ' B mentioned in Example 3.9. Do the same for the category C a ’ >3 
mentioned in Example 3.10. [§5.5, 5.12) 


4. Morphisms 

Just as in Set we highlight certain types of functions (injective, surjective, bijective), 
it is useful to try to do the same for morphisms in an arbitrary category. The reader 
should note that defining qualities of morphisms by their actions on ‘elements’ is 
not an option in the general setting, because objects of an arbitrary' category do 
not (in general) have ‘elements’. 

This is why we spent some time analyzing injectivity, etc., from different view- 
points in §§2.4-2.6. It turns out that the other viewpoints on these notions do 
transfer nicely into the categorical setting. 

4.1 . Isomorphisms. Let C be a category. 

Definition 4.1. A morphism / € Home (A, B) is an isomorphism if it has a (two- 
sided) inverse under composition: that is, if 3 g € Homc(B, A) such that 

0/=U, J 

Recall that in §2.5 the inverse of a bijection of sets / was defined ‘elementwise’; 
in particular, there was no ambiguity in its definition, and we introduced the nota- 
tion / -1 for this function. By contrast, the ‘inverse’ g produced in Definition 4.1 
does not appear to have this uniqueness explicitly built into its definition. Luckily, 
its defining property does guarantee its uniqueness, but this requires a verification: 





28 


I. Preliminaries: Set theory and categories 


Proposition 4.2. The inverse of an isomorphism is unique. 

Proof. We have to verify that if both g\ and <72 : B — > A act as inverses of a 
given isomorphism / : A — > B, then g\ = g%- The standard trick for this kind of 
verification is to compose / on the left by one of the uiorphisms, and on the right 
by the other one; then apply associativity. The whole argument can be compressed 
into one line: 

Si = SllB = Sl(/S2) = (Si/)S2 = 14S2 = 92 
as needed. □ 

Note that the argument really proves that if / is a morphism with a left- 
inverse gi and a right-inverse < 72 , then necessarily / is an isomorphism, <71 = g 2 > 
and this morphism is the (unique) inverse of /. 

Since the inverse of / is uniquely determined by /, there is no ambiguity in 
denoting it by / -1 . 

Proposition 4.3. With notation as above: 

• Each identity 1 a is an isomorphism and is its own inverse. 

• If f is on isomorphism, then / -1 is an isomorphism and further (/ -1 ) -1 = /. 

• If f e Homc(.A, 23), g e Homc(B.C) are isomorphisms, then the composition 
gf is on isomorphism and ( gf)~ 1 = f~ 1 g~ 1 . 

Proof. These all ‘prove themselves’. For example, it is immediate to verify that 
f~ l g~ 1 is a left-inverse of gf: indeed 20 , 

crV 1 )^/) = /-‘((rt/) = /-‘(w) = r7 = u- 

The verification that f~ l g~ x is also a right-inverse of gf is analogous. □ 

Note that taking the inverse reverses the order of composition: {gf)" 1 = 

Two objects A, B of a category are isomorphic if there is an isomorphism 
/ : A — > B. An immediate corollary of Proposition 4.3 is that ‘isomorphism’ is an 
equivalence relation 21 . If two objects A , B are isomorphic, one writes A = B. 

Example 4.4. Of course, the isomorphisms in the category Set are precisely the 
bijections; this was observed at the beginning of §2.5. -i 

Example 4.5. As noted in Proposition 4.3, identities are isomorphisms. They may 
be the only isomorphisms in a category': for example, this is the case in the category 
C obtained from the relation < on Z, as in Example 3.3. Indeed, for a, b objects of 
C (that is, a, b € Z), there is a morphism / : a — + 6 and a morphism g : b —* a only 
if a < b and b < a, that is, Sf a = b. So an isomorphism in C necessarily acts from 
an object a to itself; but in C there is only one such morphism, that is, 1 0 . j 

20 Associativity of composition implies that parentheses may be shuffled at will in longer 
expressions, as done here (cf. Exercise 4.1). 

21 The reader should have checked this in Exercise 2.4, for Set; the same proof will work in 
any category. 
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Example 4.6. On the other hand, there are categories in which every morphism is 
an isomorphism; such categories are called groupoids. The reader ‘already knows’ 
many examples of groupoids; cf. Exercise 4.2. j 

An automorphism of an object A of a category C is an isomorphism from A to 
itself. The set of automorphisms of A is denoted Autc(A); it is a subset of Bade (A). 
By Proposition 4.3, composition confers on Autc(A) a remarkable structure: 

• the composition of two elements f,g € Aut c (A) is an element gf € Ant c (A); 

• composition is associative; 

• Autc(A) contains the element 1^, which is an identity for composition (that 
is, /Ia = W = /); 

• every element / € Aut c (A) has an inverse / -1 € Aut c (A). 

In other words, Autc(A) is a group, for all objects A of all categories C. 

We will soon devote all our attention to groups! 

4.2. Monomorphisms and epimorphisms. As pointed out above, we do not 
have the option of defining for morphisms of an arbitrary category a notion such 
as ‘injective’ in the same way as we do for set-functions in §2.4: that definition 
requires a notion of ‘element’, and in general no such notion is available for objects 
of a category. But nothing prevents us from defining monomorphisms as we did in 
§2.6, in an arbitrary category: 

Definition 4.7. Let C be a category. A morphism / € Hom c (A, B) is a monomor- 
phism if the following holds: 

for all objects Z of C and all morphisms o', a" € Home (2, A), 

/ o c/ = / o of =*■ at = a". j 

Similarly, epimorphisms are defined as follows: 

Definition 4.8. Let C be a category. A morphism / € Home (A, B) is an epimor- 
phism if the following holds: 

for all objects Z of C and all morphisms ft, ft" 6 Homc(B, Z), 
pof = 0'of =* 0 = 0'. 

Example 4.9. As proven in Proposition 2.3, in the category' Set the monomor- 
phisms are precisely the injective functions. The reader should have by now checked 
that, likewise, in Set the epimorphisms are precisely the surjective functions (cf. Ex- 
ercise 2.5). Thus, while the definitions given in §2.6 may have looked counterintu- 
itive at first, they work as natural ‘categorical counterparts’ of the ordinary notions 
of injective/surjective functions. j 

Example 4.10. In the categories of Example 3.3, every morphism is both a 
monomorphism and an epimorphism. Indeed, recall that there is at most one mor- 
phism between any two objects in these categories; hence the conditions defining 
monomorphisms and epimorphisms are vacuous. j 
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Contemplating Example 4.10 reveals a few unexpected twists in these defini- 
tions, which defy our intuition as set-theorists. For instance, in Set, a function is 
an isomorphism if and only if it is both injective and surjective, hence if and only 
if it is both a monomorphism and an epimorphism. But in the category' defined 
by < on X, every morphism is both a monomorphism and an epimorphism, while 
the only isomorphisms are the identities (Example 4.5). Thus this property is a 
special feature of Set, and we should not expect it to hold automatically in every 
category; it will not hold in the category Ring of rings (cf. §111.2.3). It will hold in 
every abelian category (of which Set is not an example!), but that is a story for a 
very distant future (Lemma IX. 1.9). 

Similarly, in Set a function is an epimorphism, that is, surjective, if and only if 
it has a right-inverse (Proposition 2.1); this may fail in general, even in respectable 
categories such as the category Grp of groups (cf. Exercise II.8.24). 


Exercises 

4.1. c> Composition is defined for two morphisms. If more than two morphisms are 
given, e.g., 

A — B — ^ C — ^4 D — ^ E, 

then one may compose them in several ways, for example: 

(ih)(gf), (i(hg))f, i((hg)f), etc. 

so that at every step one is only composing two morphisms. Prove that the result 
of any such nested composition is independent of the placement of the parentheses. 
(Hint: Use induction on n to show that any such choice for 1 • • • /l equals 

((•■■((/„/«— 1 )/„— 2 ) • •)/.)• 

Carefully working out the case n = 5 is helpful.) [§4.1, §11.1.3] 

4.2. > In Example 3.3 we hare seen how to construct a category from a set endowed 
with a relation, provided this latter is reflexive and transitive. For what types of 
relations is the corresponding category' a groupoid (cf. Example 4.6)V [§4.1] 

4.3. Let A, B be objects of a category C, and let / 6 Home (.4, B) be a morphism. 

• Prove that if / has a right-inverse, then / is an epimorphism. 

• Show that the converse does not hold, by giving an explicit example of a category 
and an epimorphism without a right-inverse. 

4.4. Prove that the composition of two monomorphisms is a monomorphism. De- 
duce that one cau define a subcategory C moDO of a category C by takiug the same 
objects as in C and defining Homc„ ono (4, B) to be the subset of Home (A, B) con- 
sisting of monomorphisms, for all objects A, B. (Cf. Exercise 3,8; of course, in 
general C m<mo is not full in C.) Do the same for epimorphisms. Can you define 
a subcategory C noluuouo of C by restricting to morphisms that are not monomor- 
phisms? 
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4.5. Give a concrete description of monomorphisms and epimorphisms in the cate- 
gory MSet you constructed in Exercise 3.9. (Your answer will depend on the notion 
of morphism you defined in that exercise!) 


6. Universal properties 

The ‘abstract’ examples in §3 may have left the reader with the impression that 
one can produce at will a large number of minute variations of the same basic ideas, 
without really breaking any new ground. This may be fun in itself, but why do we 
really want to explore tills territory? 

Categories offer a rich unifying language, giving us a bird’s eye view of many 
constructions in algebra (and other fields). In this course, this will be most apparent 
in the steady appearance of constructions satisfying suitable universal properties. 
For instance, we will see in a moment that products and disjoint unions (as reviewed 
in §1.3 and following) are characterized by certain universal properties having to 
do with the categories Ca.b and C A,B considered in Example 3.9. 

Many of the concepts introduced in this course will have an explicit description 
(such as the definition of product of sets given in §1.4) and an accompanying de- 
scription in terms of a universal property (such as the one we will see in §5.4). The 
‘explicit’ description may be very useful in concrete computations or arguments, 
but as a rule it is the universal property that clarifies the true nature of the con- 
struction. In some cases (such as for the disjoint union) the explicit description may 
turn out to depend on a seemingly arbitrary choice, while the universal property 
will have no element of arbitrariness. In fact, viewing the construction in terms 
of its corresponding universal property clarifies why one can only expect it to be 
defined ‘up to isomorphism’. 

Also, deeper relationships become apparent when the constructions are viewed 
in terms of their universal properties. For example, we will see that products of 
sets and disjoint unions of sets are really ‘mirror’ constructions (in the sense that 
reversing arrows transforms the universal property for one into that for the other). 
This is not so clear (to this writer, anyway) from the explicit descriptions in §1.4. 

5.1. Initial and final objects. 

Definition 5.1. Let C be a category. We say that an object / of C is initial in C 
if for every object A of C there exists exactly one morphism I — > A in C: 

VA e Obj(C) : Homc(/, A) is a singleton. 

We say that an object F of C is final in C if for every object A of C there exists 
exactly one morphism A — * F in C: 

VA € Obj(C) : Homc(A, F) is a singleton. j 

One may use terminal to denote either possibility, but in general we would 
advise the reader to be explicit about which ‘end’ of C one is considering. 

A category need not have initial or final objects, as the following example shows. 
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Example 5.2. The category obtained by endowing Z with the relation < (see 
Example 3.3) has no initial or final object. Indeed, an initial object in this category 
would be an integer * such that i < « for all integers a; there is no such integer. 
Similarly, a final object would be an integer / larger than every integer, and there 
is no such thing. 

By contrast, the category considered in Example 3.6 does have a final object, 
namely the pair (3,3); it still has no initial object. a 

Also, initial and final objects, when they exist, may or may not be unique: 

Example 5.3. In Set, the empty set 0 is initial (the ‘empty graph’ defines the 
unique function from 0 to any given object!), and clearly it is the unique set that 
fits this requirement (Exercise 5.2). 

Set also has final objects: for every set A , there is a unique function from A 
to a singleton {p} (that is, the ‘constant’ function). Every singleton is final in Set; 
thus, final objects are not unique in this category. a 

However, we claim that if initial/final objects exist, then they are unique up 
to a unique isomorphism. We will invoke this fact frequently, so here is its official 
statement and its (immediate) proof: 

Proposition 5.4. Let C be a category. 

• If h, 1% are both initial objects in C, then /j =i Iz- 

• If Fi, F 2 are both final objects in C, then Fi = F%. 

Further, these isomorphisms are uniquely determined. 

Proof. Recall that (by definition of category!) for every object A of C there is at 
least one element in Home (A, A), namely the identity 1 a- If I is initial, then there 
is a unique morphism I —* I, which therefore must be the identity 1/. 

Now assume I\ and 1 z are both initial in C. Since I\ is initial, there is a unique 
morphism / : Ii — > I 2 in C; we have to show that / is an isomorphism. Since h is 
initial, there is a unique morphism g : Iz — ► I\ in C. Consider gf : I\ —* as 
observed, necessarily 

9f = 1/. 

since h is initial. By the same token 

fa = If* 

since I 2 is initial. This proves that f : —* Iz is en isomorphism, as needed. 

The proof for final objects is entirely analogous (Exercise 5.3). □ 

Proposition 5.4 “explains” why, while not unique, the filial objects in Set are 
all isomorphic: no singleton is more ‘special’ than any other singleton; this is the 
typical situation. There may be psychological reasons why one initial or final object 
looks more compelling than others (for example, the singleton {0} = 2® may look 
to some like the most ‘natural’ choice among all singletons), but this plays no role 
in how these objects sit in their category. 
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5.2. Universal properties. The most natural context in which to introduce uni- 
versal properties requires a good familiarity with the language of functors, which we 
will only introduce at a later stage (cf. §VIH.1.1). For the purpose of the examples 
we will run across in (most of) this book, the following ‘working definition’ should 
suffice. 

We say that a construction satisfies a universal property (or ‘is the solution to 
a universal problem’) when it may be viewed as a terminal object of a category. 
The category depends on the context and is usually explained ‘in words’ (and often 
without even mentioning the word category). 

In particularly simple cases this may take the form of a statement such as 0 is 
universal with respect to the property of mapping to sets; this is synonymous with 
the assertion that 0 is initial in the category of Set. 

More often, the situation is more complex. Since being initial/final amounts to 
the existence and uniqueness of certain morphisms, the ‘explanation’ of a universal 
property may follow the pattern, “object X is universal with respect to the following 
property: for any Y such that. . . , there exists a unique morphism Y —* X such 
that ” 

The nobso-naive reader will recognize that this explanation hides the definition 
of an accessory category and the statement that X is terminal (probably final in 
this case) in this new category. It is useful to learn how to translate such wordy 
explanations into what they really mean. Also, the reader should keep in mind that 
it is not uncommon to sweep under the rug part of the essential information about 
the solution to a universal problem (usually some key morphism): this information 
is presumably implicit in any given set-up. This will be apparent from the examples 
that follow. 

5.3. Quotients. Let ~ be an equivalence relation defined on a set A. Let’s parse 
the assertion: 

“The quotient A/~ is universal with respect to the property of mapping A to a 
set in such a way that equivalent elements have the same image. ” 

What can this possibly mean, and is it true? 

The assertion is talking about functions 

A — 

with Z any set, satisfying the property 

a' ~ a" => <p{a') = <p(a"). 

These morphisms are objects of a category (very similar to the category' defined in 
Example 3.7); for convenience, let’s denote such an object by (<p, Z). The only rear 
sonable way to define morphisms ((pi, Z\) —* (<p 2 , Zl) is as commutative diagrams 


Z, 2-4Z 2 



A 
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This its the same definition considered in Example 3.7. 

Does this category have initial objects? 

Claim 6.5. Denoting by ir the ‘ canonical projection’ defined in Example 2.6, the 
pair (jt, A/~) is an initial object of this category. 

This is what our writer meant by the mysterious assertion copied above. Once 
this is understood, it is very easy to prove that the assertion is indeed correct. 

Proof. Consider any (y>, Z) as above. We have to prove that there exists a unique 
morphism (jt, A/~) —*■ (y i, Z), that is, a unique commutative diagram 

A/' — >Z 



A 


that is, a unique function tp making this diagram commute. 

Let [a]^, be an arbitrary element of A/~. If the diagram is indeed going to 
commute, then necessarily 

V(M~) = v>(«); 

this tells us that tp is indeed unique, if it exists at all — that is, if this prescription 
does define a function A/~ — * Z. 

Hence, all we have to check is that Tfi is well-defined, that is, that if (ax]^, = 
[a 2 ]~, then yj(aj) = q>(az)\ and indeed 

(«i]~ = (<* 2 )- =*• ai ~ a 2 =» y>(*i) = y>(<*2)- 
This is precisely the condition that morphisms in our category satisfy. □ 

Note the several levels of sloppiness in the assertion considered above: it does 
not tell ns very explicitly what category to consider; it does not tell us that we 
should especially pay attention to initial objects in this category. Worst of all, 
the solution to the universal problem is not really j 4/~, but rather the morphism 
ir : A — » A/~. 

The reader should practice the skill of translating loose assertions such as the 
one given above into precise statements; it is not at all uncommon to run into 
examples at the same level of ‘abuse of langnage’ as this one. 

The reason why we get away with writing such assertions is that the context 
really allows the experienced reader to parse them effectively, and they are substan- 
tially more concise than their spelled-out version. After all, there is in general no 
conceivable choice for a morphism A — *■ A/~ other than the canonical projection; 
hence, neglecting to mention it is forgivable. Also, the final object in the category 
considered above is supremely uninteresting (what is it? cf. Exercise 5.5), so surely 
we must have meant the initial one. 

What do we learn by viewing quotients in terms of their universal property? 
For example, suppose ~ is the equivalence relation defined starting from a function 
/ : A — > B, as in §2.8. Then the reader will realize easily that im / also satisfies 
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the universal property given above for A/~; therefore (by Proposition 5-4) im / 
and A/~ must be isomorphic. This is precisely the content of Theorem 2.7; thus, 
the universal property sheds some light on the ‘canonical decomposition’ studied 
in §2.8. 

5.4. Products. It is also a very good exercise to stare at a familiar construction 
and try to see the universal property which may be behind it. We will now encourage 
you, dear reader, to contemplate the notion of the product of two sets given in §1.4 
and to see if the universal property it satisfies jumps out at you. Spoiler follows, 
so this is a good time to stop reading these notes and to try on your own. 

Here is the universal property'. Let A, B be sets, and consider the product 
Ax B, with the two natural projections: 




Ax B 


B 


(see Example 2.4). Then for every set Z and morphisms 

f*>A 


there exists a unique morphism <r : Z — ► Ax B such that the diagram 



Sb 


A 

B 


commutes. 

In this situation, a is usually denoted /a * f 13- 
Proof. Define Vz 6 Z 

*(») = (/*(*),&(*))• 

This function 22 manifestly makes the diagram commute: Vz e Z 

tt a o(z) = 7M(/A(*),/ii(z)) = f A (z), 

showing that ita# — fA and similarly xb<t = }b- 

Further, the definition is forced by the commutativity of the diagram; so <r is 
unique, as claimed. □ 


22 


Note that there is no ‘well-definedness’ iBsue this time. 
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In other words, products of sets (or, more precisely, products of sets together 
with the information of their natural projections to the factors) are final objects in 
the category Ca,b considered in Example 3.9, for C = Set. 

What is the advantage of viewing products this way? The main advantage 
is that the universal property may be stated in any category', while the definition 
of products given in §1 .4 only makes sense in Set (and possibly in other categories 
where one has a notion of ‘elements’). We say that a category C has (finite) products, 
or is a category ‘with (finite) products’, if for all objects A, B in C the category 
C a,b considered in Example 3.9 has final objects. Such a final object consists of the 
data of an object of C, usually denoted Ax B, and of two morphisms A x B — ► A, 
Ax B — > B. 

Note that a ‘product’ from this perspective does not need to ‘look like a prod- 
uct’. Consider our recurring example of the category obtained from < on Z, as 
in Example 3.3. Does this category have products? Objects of this category are 
simply integers a, b e Z; call axb for a moment the ‘categorical’ product of a and 
6. The universal property written out above becomes, in this case, for all z € Z 
such that z <a and z < b, we have z< axb. 

This universal problem does have a solution Va,b: it is conventionally not 
called axb, but rather min(o, 6). It is immediate to see that min(a, 6) satisfies 
the property. Thus this category has products, and in fact we see that the product 
in this category amounts to the familiar operation of taking the minimum of two 
integers. 

Thus there is an unexpected connection between ‘the Cartesian product of two 
sets’ and ‘the minimum of two integers’: both are examples of products, taken 
in different categories; they both satisfy ‘the same’ universal property', in different 
contexts. 

5.5. Coproducts. The prefix co- usually indicates that one is ‘reversing all ar- 
rows’. Just as products axe final objects in the categories C a,B obtained by consid- 
ering morphisms in C with common source, whose targets are A and B, co products 
will be initial objects in the categories 23 of morphisms with common tar- 

get, whose sources are A and B. Dear reader, look away and spell this universal 
property out before we do. 

Here it is. Let A, B be objects of a category C. A coproduct A H B of A 
and B will be an object of C, endowed with two morphisms m : A — ► A II B, 
i b : B — » A II B and satisfying the following universal property: for all objects Z 
and morphisms 



^Theee categories were also considered in Example 3.9; cf. Exercise 3.11. 
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there exists a unique morphism a : All B -* Z such that the diagram 

fA 


Ar — 




fB 


commutes. 

The symmetry with the universal property of products is hopefully completely 
apparent. We say that a category C has coproducts if this universal problem has a 
solution for all pairs of objects A and B. 

Is the reader familiar with any coproduct? Yes! 

Proposition 5.6. The disjoint union is a coproduct in Set. 


Proof. Recall (§1.4) that the disjoint union A II B is defined as the union of two 
disjoint isomorphic copies A', B' of A, B, respectively; for example, we may let 
A' = {0} x A, B' — {1} x B. The functions ig are defined by 

M(o) = (0, a), i B {b) = (1, 6), 


where we view these elements as elements of ({0} x A)U({l}x5). 

Now let /a : A — » Z, fg : B — » Z be arbitrary morphisms to a common target. 
Define 

<t :A11B~ ({0} x A) U ({1} x B) -* Z 


by 

f //!(<*) if c= (0,a) € {0} x A, 

* {C) “ \ fB(b) if c= (1,6) e {1} x B. 

This definition makes the relevant diagram commute and is in fact forced upon us 
by this commutativity, proving that a exists and is unique. □ 


This observation tells us that the category Set has coproducts and further sheds 
considerable light on the mysteries of disjoint unions. For example, there was an 
element of arbitrariness in our choice of *a’ disjoint union, although different choices 
led to isomorphic notions. Now we see why: terminal objects of a category are not 
unique in general, although they are unique up to isomorphism (Proposition 5.4); 
there is not a ‘most beautiful’ disjoint union of two sets just as there is not a ‘most 
beautiful’ singleton in Set (cf. §5.1). 

Also, an unexpected ‘symmetry’ between products and disjoint unions becomes 
suddenly apparent from the point of view of universal properties. 

The reader is invited to contemplate the notion of coproduct in the other cat- 
egories we have encountered. For example (and probably not surprisingly at this 
point) the category obtained from < on Z does have coproducts: the coproduct of 
two objects (i.e., integers) a, bis simply the maximum of a and b. 
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Exercises 


5.1. Prove that a final object in a category C is initial ill the opposite category C op 
(cf. Exercise 3.1). 

5.2. t> Prove that 0 is the unique initial object in Set. [§5.1] 

5.3. > Prove that final objects are unique up to isomorphism. [§5.1] 

5.4. What are initial and final objects in the category of ‘pointed sets’ (Exam- 
ple 3.8)? Are they unique? 

5.5. > What are the final objects in the category considered in §5.3? [§5.3] 

5.6. > Consider the category corresponding to endowing (as in Example 3.3) the 
set Z + of positive integers with the diwsibility relation. Thus there is exactly one 
morphism d — > m in this category if and only if d divides m without remainder; 
there is no morphism between d and m otherwise. Show that this category has 
products and coproducts. What are their ‘conventional’ names? [§VII.5.1] 

5.7. Redo Exercise 2.9, this time using Proposition 5.4. 

5.8. Show that in every category C the products Ax B and B x A are isomorphic, 
if they exist. (Hint: Observe that they both satisfy the universal property for the 
product of A and B\ then use Proposition 5.4.) 

5.9. Let C be a categoiy with products. Find a reasonable candidate for the 
universal property that the product A x B x C of three objects of C ought to 
satisfy, and prove that both (Ax B) x C and Ax (B x C) satisfy this universal 
property. Deduce that (Ax B) xC and A x (B x C) are necessarily isomorphic. 

5.10. Push the envelope a little further still, and define products and coproducts 
for families (i.e., indexed sets) of objects of a category. 

Do these exist in Set? 

It is common to denote the product A X - • • X A. by A". 

n times 

5.11. Let A, resp. B, be a set, endowed with an equivalence relation ~ 4 , resp. ~b. 
Define a relation ~ on A x B by setting 

(ai, 6 i) ~ (< 22 , 62 ) a.i ~a 02 and &i ~b b 2 . 

(This is immediately seen to be an equivalence relation.) 

• Use the universal property for quotients (§5.3) to establish that there are func- 
tions (A x B)/~ — > A/~.a, (A x B)/~ —* B/~b- 

• Prove that (A x B)/~, with these two functions, satisfies the universal property 
for the product of A/~a and B/~g. 

• Conclude (without further work) that (A x B)/~ = (A/~a) x (B/~b). 
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5.12. -i Define the notions of fibered products and fibered coproducts , as terminal ob- 
jects of the categories C a j, C a ' 3 considered in Example 3.10 (cf. also Exercise 3.11), 
by stating carefully the corresponding universal properties. 

As it happens, Set has both fibered products and coproducts. Define these 
objects ‘concretely’, in terms of naive set theory. [II.3.9, 111.6.10, III.6.11] 





Chapter II 


Groups, first encounter 


In this chapter we introduce groups, we observe they form a category (called Grp), 
and we study ‘general’ features of this category: what are the monomorphisms and 
the epimorphisms in this category? what is the appropriate notion of ‘equivalence 
relation’ and ‘quotients’ for a group? does a ‘decomposition theorem’ hold in Grp? 
and other analogous questions. 

In Chapter m we will acquire a similar degree of familiarity with rings and 
modules. A more object-oriented analysis of Grp (for example, a treatment of the 
famous SyUnsi theorems, ‘composition series’, or the classification of finite abelian 
groups) is deferred to Chapter IV. 


1. Definition of group 
1.1. Groups and groupoids. 

Joke 1.1. Definition: A group is a groupoid with a single object. _i 

This is actually a perfectly viable definition, since groupoids have been defined 
already (in Example 1.4.6); but most mathematicians would find it ludicrous to 
introduce groups in this fashion, or they will at the very least politely express 
doubts on the pedagogical effectiveness of doing so. In order to redeem himself, the 
author will parse this definition right away to show what it really says. If * is the 
lone object of such a groupoid G, 

HomG(*, *) = Autc(*) 

(because G is a groupoid'.), and this set carries all the information about G. Call 
this set G. Then (by definition of category) there is an associative operation on G, 
with an identity 1*, and (by definition of groupoid, which says that every morphism 
in G is an isomorphism) every g € G has an inverse g~ l € G. 
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That is what a group is 1 : a set G with a composition law satifsying a few key 
axioms, i.e., associativity, existence of identity, and existence of inverses. 

1.2. Definition. Now for the official definition. Let G be a nonempty set, endowed 
with a binary operation, that is, a ‘multiplication’ map 

• : G x G -* G. 


Our notation will be 

•(g,h) =:g»h 

or simply gh if the name of the operation can be understood. The careful reader 
may have expected that we should write h • g, in the style of what we have done 
for categories, but this is what common conventions dictate 2 . 

Definition 1.2. The set G, endowed with the binary operation • (briefly, ((?,•), 
or simply G if the operation can be understood) is a group if 

(i) the operation • is associative, that is, 

(Vg,h,keG): (g • h) • k = g m (h» k); 

(ii) there exists an identity element cq for •, that is, 

(3e<j € G) (Vg € G) : g • e G = g = e G • g\ 

(iii) every element in G has an inverse with respect to •, that is, 

(Vp e G) (3h e G) : g»h = eG = hag. j 

Example 1.3. Since we explicitly require G to be nonempty, the most economical 
way to concoct a group is by letting G = {e} be a singleton. There is only one 
function G xG -* G in this case, so there is only one possible binary operation on 
G, defined by 

e» e := e. 

The three axioms trivially hold for this example, so {e} is equipped with a unique 
group structure. 

This is usually called the trivial group; purists should call any such group a 
trivial group, since every singleton gives rise to one. j 

Example 1.4. The reader should check carefully (cf. Exercise 1.2) that (Z, +), 
(Q, +), (R, +), (C,+), and several variations using • (for example, the subset 
{+1,-1} of Z, with ordinary multiplication) all give example of groups. While 
very interesting in themselves, these examples do not really capture at any intu- 
itive level ‘what’ a group really is, because they are too special. Fbr example, all 
these examples are commutative (see §1.5). j 


'From this perspective, Joke 1.1 is a little imprecise: the group is not the groupoid G, but 
rather the set of isomorphisms in G, endowed with the operation of composition of morphisms. 
“Not without exceptions; see for example permutation groups, discussed in §2. 
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Example l.S. My readers are likely familiar with an extremely important non- 
commutative example, namely the group of invertible, nxn matrices with (say) real 
entries, n > 2. We will generally shy away from this class of examples in these early 
chapters, since we will have ample opportunities to think about matrices when we 
approach linear algebra (starting in Chapter VI). But we may occasionally borrow 
a matrix or two before then. The reader should now check that 2x2 matrices 

G 2) 

with real entries, and such that ad— be 0, form a group under the ordinary matrix 
multiplication: 

&l\ ( &2 &2\ _ /«l02 + &lC2 «l&2 + 

\c x dij ^C2 d 2 J ~ \cia 2 + d t c 2 Cib 2 +did 2 )‘ 

(The condition ad — be yf 0 guarantees that the matrix is invertible. What is its 
inverse?) Since, for example, 

G DG M2 Ml M! 9G 0- 

this group is indeed not commutative. 

The group of invertible nxn matrices with real entries is denoted GL n (R). j 
W e will encounter more representative examples in §2. 

1.3. Basic properties. Rrom the ‘groupoid’ point of view, the identity ec would 
be denoted 1». It is not uncommon to omit G from the notation (when the group 
is understood) and to use different symbols rather than e to denote this element: 
1 and 0 are popular alternatives, depending on the context. In any case, for any 
given group G this element is unique. That is, no other element of G can work as 
an identity: 

Proposition 1.6. If he G is an identity of G, then h = e<;. 

Incidentally, this makes groups pointed sets in the sense of Example 1.3.8: every 
group has a well-defined distinguished element. 

Proof. Using first that ec is an identity and then that h is an identity, one gets 3 

h = eah= eg. 

(Amusingly, this argument only uses that ec is a ‘left’ identity and h is a ‘right’ 
identity.) □ 

Proposition 1.7. The inverse is also unique: if hi, h 2 are both inverses of g in G, 
then hi — h 2 . 

3 As previously announced, we may omit the symbol * for the operation, since for the time 
being we are only considering one operation. This will be done without warning in the future. 
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Proof. This actually follows from Proposition 1.4.2 (by viewing G as the set of 
isomorphisms of a groupoid with a single object). The reader should construct 
a stand-alone proof, using the same trick, but carefully hiding any reference to 
morphims. □ 

Proposition 1.7 authorizes us to give a name to the inverse of g: this is usually 4 
denoted jr _1 . 

One more notational item is in order. The definition of a group only contem- 
plates the ‘product’ of two elements; in multiplying a string of elements, one may in 
principle haw a choice as to the order in which products are executed. For example, 

(ffi • 02> • 93 

stands for: apply the operation • to g x and g%, and then apply • again to the result 
of this operation and gni while 

91 • (92 • 53) 

stands for: apply • to and 33, and then apply it again to g\ and to the result of 
this operation. 

Associativity tells us precisely that the result of the operation on three elements 
does not depend on the way in which we perform it. With this in mind, we are 
authorized to write 


9i *02 * 03! 

this expression is not ambiguous, by associativity. What about four or more ele- 
ments? The reader should have checked in Exercise 1.4.1 that all ways to associate 
any number of elements leads to the same result. So we are also authorized to write 
things like 

Si *02 *93 ••■••Sir; 

this is also unambiguous. The reader should keep in mind, however, that of course 
the order in which the elements are listed is important: in general, 

01 • 02 • 03 • • • 017 7* 02 • 01 • 03 • - • 017- 


Of course no such care is necessary if all g x coincide; the conventional ‘power’ 
notation can then be used 5 : (?° = ec, and for a positive integer n 

g n = g -y -g , 0~ n = 0~‘ 

n times n limes 

It is easy to check that then Vg € G and Vm, neZ 


9 =9 9 • 


4 But note the ‘abelian’ case, discussed in §1.5. 

5 In the abelian case one uses ‘multiples’; cf, §1.5, 
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1.4. Cancellation. ‘Cancellation’ holds in groups. That is, 

Proposition 1.8. Let G be a group. Then Vo, g,h€ G 

ga = ha ==> g = h, ag= ah ==*• g = h. 

Proof. Both statements are proven by multiplying (on the appropriate side) by a -1 
and applying associativity. For example, 

ga = ha ==$■ (ga)a~ 1 = (/ia)a -1 =*> g(aa~ l ) = h(ao _1 ) =$■ geG = hea 
=► g = h. 

The proof for the other implication follows the same pattern. □ 

Examples of operations which do not satisfy cancellation, and hence do not 
define groups, abound. For instance, the operation of ordinary multiplication does 
not make the set R of real numbers into a group: indeed, 0 ‘cannot be cancelled’ 
since 1 • 0 = 2 • 0 even if 1 jt 2. Of course, the problem here is that 0 does not 
have an inverse in M, with respect to multiplication. As it happens, this is the only 
problem with this example: ordinary multiplication does make 

1’ := R \ {0} 

into a group, as the reader should check immediately. 

1.5. Commutative groups. One axiom not appearing in the definition of group 
is commutativity: we say that the operation • is ‘commutative’ if 

(iv) (Vp, h € G) : gmh = h»g. 

We say that two elements g, h ‘commute’ if gh = hg. Thus, in a commutative 
group any two elements commute. 

‘Commutative groups’ are important objects: they arise naturally in several 
contexts, especially as ‘modules over the ring Z’ (about which we will have a lot to 
say in Chapter III and beyond). When they do so, they are usually called abelian 
groups. 

The notation used in treating abelian groups differs somewhat from the stan- 
dard notation for groups. This is to emphasize the ‘Z-module structure’, and it is 
helpful when an abelian group coexists with other operations — a situation which 
we will encounter frequently. 

Thus, the operation in an abelian group A is, as a rule, denoted by + and is 
called ‘addition’; the identity is then called 0.*; and the inverse of an element a € A 
is denoted —a (and maybe should be called the ‘opposite’?). The ‘power’ notation 
is of course replaced by ‘multiple’: Oo. = 0, and for a positive integer n 

na = a H l-a, (— n)a = (-a) H (-a) . 

' y ' ' * 

n times n times 

The reader should keep in mind that at this stage ‘na’ is a notation, not the 
result of applying a binary operation to two elements n, a of A. Indeed, n € Z may 
very well not be an element of A in any reasonable sense. Moreover, it may very 
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well be that na = ma even if n ^ m, in spite of the fact that ‘cancellation’ works 
in groups. 

The qualifier ‘abelian’ and the notation 0^, —a, etc., are mostly used for com- 
mutative groups arising in certain standard situations: for example, the notions 
of rings, modules, vector spaces are defined by suitably enriching a commutative 
group, which is then promoted to ‘abelian’ for notational convenience. 

There are other situations in which commutative groups arise naturally, without 
triggering the ‘abelian’ notation. For example, the group (R*, •) mentioned at the 
end of §1.4 is commutative, but its operation is indicated by • (if at all), and its 
identity element is written 1. History, rather than logic, is often the main factor 
determining notation. 

1.6. Order. 

Definition 1.9. An element g of a group G has finite order if 6 g n = e for some 
positive integer n. In this case, the order |#| is the smallest positive « such that 
g n = e. One wxites |p| = oo if g does not have finite order. _j 

By definition, if g n = e for some positive integer n, then \g\ < n. One can be 
more precise: 

Lemma 1.10. If g n = e for some positive integer n, then |$| is a divisor of n. 

Proof. As observed, n > |g| by definition of order, that is, n — |j| > 0. There must 
then exist 7 a positive integer m such that 

r = n — |g| ■ m > 0 and n - | g\ ■ (m + 1) < 0, 
that is, r < |g|. Note that 

Q —9 =9 ’ (9 ) — e e — e. 

By definition of order, |g| is the smallest positive integer such that gM = e. Since 
r is smaller than \g\ and g r = e,r cannot be positive; hence r = 0 necessarily. This 
says 

0 = n - |p| • m, 

that is, n = \g\ m, proving that n is indeed an integer multiple of \g\ as claimed. □ 

This lemma has the following immediate and useful consequence, which we 
encourage the reader to keep firmly in mind: 

Corollary 1.11. Let g be an element of finite order, and let N € Z. Then 
g N = e ■$=$■ N is a multiple of\g\. 

6 Of course in an abelian group we would write the following prescription as ng = 0. 

7 Purists may object that here we are surreptitiously using fairly sophisticated information 
about Z, namely the ‘division algorithm’, hence essentially the fact that £ is a Euclidean domain! 
This is material that will have to wait until Chapter V to be given some justice. We may os well 
be open about it and admit that yes, we are assuming that the readers have already acquired a 
thorough familiarity with the operations of addition and multiplication among integers. Shame 
on us! 
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Definition 1.12. If G is finite as a set, its order |G| is the number of its elements; 
we write |G| = oo if G is infinite. j 

Cancellation implies that |g| < |G| for all g € G. Indeed, this is vacuously true 
if |G| = oo; if G is finite, consider the |G| + 1 powers 

0 2 3 IGI 

9 = e,9,9 ,9 ' 

of g. These cannot all be distinct; hence 

(3i,i) 0 < i < j < |G| such that g' = gi . 

By cancellation (that is, multiplying on the right by g~ l ) 

9 j ~ * = e, 

showing |fl| < (j — i) < |G|. 

We will soon be able to formulate a much more precise statement concerning 
the relation between the order of a group and the order of its elements: if g € G 
and |G| is finite, then the order of g divides the order of G. This will be an 
immediate consequence of Lagrange’s theorem; cf. Example 8.15. 

Another general remark concerning orders is that their behavior with respect 
to the operation of the group is not always predictable: it may very well happen 
that g, h have finite order in a group G, and yet = oo, or |ph| = your favorite 
positive integer: work out Exercise 1.12 and Exercise 2.6 if you don’t believe it. 

On the other hand, the situation is more constrained if g and h commute. In 
the extreme case in which g = h, it is easy to obtain a very precise statement: 

Proposition 1.13. Let g € G be an element of finite order. Then g m has finite 
order Vm > 0, and in fact? 

lcm(tn, |g|) _ |g| 


1 = 


gcd(m, |s|) ' 


Proof. The equality of the two numbers and | g |j follows from ele- 

mentary properties of gcd and 1cm: lcm (a,b) = «6/gcd(o, 6) for all a and 6. So we 
only need to prove that \g m \ = ISSlfcidill, 

The order of g m is the least positive d for which 

9 md = e> 

that is (by Corollary' 1.11) for which md is a multiple of \g\. In other words, m|g m | 
is the smallest multiple of m which is also a multiple of |g|: 

m|g m | = lcm(m,|g|). 

The stated formula follows immediately from this. □ 

In general, for commuting elements, 

Proposition 1.14. Ifgh = hg, then\gh\ divides lcm(|p|, |h|). 


®The notation lcm stands for ‘least common multiple’. We are also assuming that the reader 
ib familiar with Bimple properties of gcd and lcm. 
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Proof. Let |<?[ = m, |h| = n. If iV is any common multiple of m and n, then 
g N = h N = e by Corollary 1.11. Since g and h commute, 

(gh) N = (gh)(gh) (gh) = gg ghh h = g N h N = e. 

V ■ " v •••' y 

N times AT times ^ times 

As this holds for every common multiple N of m and n, in particular 

( 9 ft )l«n< m ,«) = e 

The statement then follows from Lemma 1.10. □ 

One cannot say more about |gh| in general, even if g and h commute (Exercise 1.13). 
But see Exercise 1.14 for an important special case. 


Exercises 


1.1. t> Write a careful proof that every group is the group of isomorphisms of a 
groupoid. In particular, every group is the group of automorphisms of some object 
in some category. (§2.1] 

1.2. i> Consider the ‘sets of numbers’ listed in §1.1, and decide which are made into 
groups by conventional operations such as + and -. Even if the answer is negative 
(for example, (R, •) is not a group), see if variations on the definition of these sets 
lead to groups (for example, (R*, •) is a group; cf. §1.4). [§1.2] 

1.3. Prove that (gh)~ l = h~ l g~ l for all elements g, h of a group G. 

1.4. Suppose that g 2 = e for all elements g of a group G; prove that G is commu- 
tative. 

1.5. The ‘multiplication table’ of a group is an array compiling the results of all 
multiplications g • h: 


• 

e 

... 

h 

... 

e 

e 


h 







9 

9 


g»k 


... 






(Here e is the identity element. Of course the table depends on the order in which 
the elements are listed in the top row and leftmost column.) Prove that every row 
and every column of the multiplication table of a group contains all elements of the 
group exactly once (like Sudoku diagrams!). 
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1.6. -i Prove that there is only one possible multiplication table for G if G has 
exactly 1, 2, or 3 elements. Analyze the possible multiplication tables for groups 
with exactly 4 elements, and show that there are two distinct tables, up to re- 
ordering the elements of G. Use these tables to prove that all groups with < 4 
elements are commutative. 

(You are welcome to analyze groups with 5 elements using the same technique, 
but you will soon know enough about groups to be able to avoid such brute-force 
approaches.) [2.19] 

1.7. ProveCoroDaryl.il. 

1.8. -i Let G be a finite group, with exactly one element / of order 2. Prove that 
U g ea9 = f- [4-161 

1.9. Let G be a finite group, of order n, and let m be the number of elements 
g 6 G of order exactly 2. Prove that n — m is odd. Deduce that if n is even, then 
G necessarily contains elements of order 2. 

1.10. Suppose the order of g is odd. What can you say about the order of g 2 ? 

1.11. Prove that for all g, h in a group G, |g/t| = |hg|. (Hint: Prove that |aga _1 | = 
|g| for all o, g in G.) 

1.12. > In the group of invertible 2x2 matrices, consider 

-G -;)• -!)• 

Verify that |g| = 4, |h| = 3, and |gft| = oo. [§1.6] 

1.13. t> Give an example showing that |g/i| is not necessarily equal to lcm(|g|, |h|), 
even if g and h commute. [§1.6, 1.14] 

1.14. t> As a counterpoint to Exercise 1.13, prove that if g and h commute and 
pcd(|p|, \h\) = 1, then \gh\ = |p| |/i|. (Hint: Let N = |ph|; then g N = (h -1 )^ What 
can you say about this element?) [§1.6, 1.15, §IV.2.5] 

1.15. -> Let G be a commutative group, and let g € G be an element of maximal 
finite order, that is, such that if h S G has finite order, then |A| < |p|. Prove that 
in fact if h has finite order in G, then |/t| divides |p|. (Hint: Argue by contradiction. 
If |fe| is finite but does not divide |p|, then there is a prime integer p such that |p| = 
p m r, |h| = p n s, with r and s relatively prime to p and m < n. Use Exercise 1.14 to 
compute the order of g pm h’.) (§2.1, 4.11, IV .6. 15] 


2. Examples of groups 

2.1. Symmetric groups. In §1.4.1 we have already observed that every object A 
of every category C determines a group, called Autc(A), namely the group of auto- 
morphisms of A. In a somewhat artificial sense it is clear that every group arises in 
this fashion (cf. Exercise 1.1); this fact is true in more ‘meaningful’ ways, which wiU 
become apparent when we discuss group actions (§9): cf. especially Theorem 9.5 
and Exercise 9.17. 
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In any case, this observation provides the reader with an infinite class of very 
important examples: 

Definition 2.1. Let A be a set. The symmetric group, or group of permutations 
of A, denoted S, t, is the group Autset(d). The group of permutations of the set 
{1, . . . , n} is denoted by S n . _j 


The terminology is easily justified: the automorphisms of a set A are the set- 
isomorphisms, that is, the byections, from A to itself; applying such a bisection 
amounts precisely to permuting (‘scrambling’) the elements of A. This operation 
may be viewed as a transformation of A which does not change it (as a set), hence 
a ‘symmetry’. 

The groups Sa are famously large: as the reader checked in Exercise 1.2.1, 
|5„| = n!. For example, IS 70 I > IO 100 , which is substantially larger than the 
estimated number of elementary particles in the observable universe. 

Potentially confusing point: The various conventions clash in the way the oper- 
ation in Sa should be written. From the ‘automorphism’ point of view, elements of 
Sa are functions and should be composed as such; thus, if f,g € Sa = Auts at (d), 
then the ‘product’ of / and g should be written go f and should act as follows: 

(Vp € A) : go f(p) = g(/(p)). 


But the prevailing style of notation in group theory would write this element 
as fg, apparently reversing the order in which the operation is performed. 

Everything would fall back into agreement if we adopted the convention of writ- 
ing functions after the elements on which they act rather than before: (p)f rather 
than /(p). But one cannot change century-old habits, so we have no alternative 
but to live with both conventions and to state carefully which one we are using at 
any given time. 

Contemplating the groups 5„ for small values of n is an exercise of inestimable 
value. Of course S\ is a trivial group; consists of the two possible permutations: 


{ 


ImI 

2h2 


and 


{ 


1^2 

2h1 


which we could call e (identity) and / (flip), with operation 


ee = ff = e, ef = fe= f. 


In practice we cannot give a new name to every different element of every permuta- 
tion group, so we have to develop a more flexible notation. There are in fact several 
possible choices for this; for the time being, we will indicate an element o € S n by 
listing the effect of applying 0 underneath the list 1, . . . , n, as a matrix 9 . Thus the 
elements e, / in S 2 may be denoted by 


e = 




®This is only a notations! device — these matrices should not be confused with the matrices 
appearing in linear algebra. 
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In the same notational style, S3 consists of 



2 3Wi 2 3) ( 1 2 3\ (1 
2 3; ’\2 1 3) ’V3 2 1/ ’ 


2 3 

3 2 


D-G 


2 

1 



2 

3 



For the multiplication, we will adopt the sensible (but not very standard) conven- 
tion mentioned above and have permutations act ‘on the right’: thus, for example, 


(1 2 3U1 2 3\ = (1 2 3\ 
\2 1 3A3 1 V A3 1 V 


and similarly 


2 


(1 2 
12 1 


That is, 


3) (3 1 2)-®’ ® (2 1 a) (3 1 i )- 2 ' 

(1 2 3\/l 2 3\ _/l 2 3\ 

\2 1 3^3 1 2/ — \1 3 2) 


since the permutations on both sides of the equal sign act in the same way on 
1 , 2, 3. The reader should now check that 


(1 2 3W1 2 3W1 2 3\ 
\3 1 2A2 1 V V3 2 1 )■ 

That is, letting 

(1 2 3\ (12 3\ 

X ~{2 1 $)’ y A 3 1 2 )' 

then 


yx # xy, 

showing that the operation in S3 does not satisfy the commutative axiom. Thus, 
S3 is a noncommutative group; the reader will immediately realize that in fact S n is 
noncommutative for all ra > 3. 

While the commutation relation does not hold, other interesting relations do 
hold in S 3 . For example, 

2 3 

x =e, y =e, 

showing that S 3 contains elements of order 1 (the identity e), 2 (the element x), 
and 3 (the element y); cf. Exercise 2.2. (Incidentally, this shows that the result 
of Exercise 1.15 does require the commutativity hypothesis.) Also, 


yx = 



= xy 2 


as the reader may check. Using these relations, we see that every product of any 
assortment of x and y, x n y , ’ > x‘ 3 y li ■■■, may be reduced to a product x t y } with 
0 < i < 1 , 0 < 3 < 2 , that is, to one of the six elements 

e, y, y 2 , x, xy, xy 2 : 


for example, 

2 /W = (y a ) 2 y(x 2 ) 6 x j/V = [yx)y 2 = (xy 2 )y 2 = xy 3 y = xy. 
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On the other hand, these she elements are all distinct — this may be checked by 
cancellation and order considerations 10 . For example, if we had xy 2 = y, then we 
would get x = y" 1 by cancellation, and this cannot be since the relations tell us 
that x has order 2 and y~ l has order 3. 

The conclusion is that the six products displayed above must be all six elements 

ofS 3 : 

S 3 = {e,x,y,xy,y 2 ,xy 2 }. 

hi the process we have verified that S3 may also be described as the group ‘gener- 
ated’ by two elements x and y, with the ‘relations’ x 2 = e, y 3 = e, yx = xy 2 . 

More generally, a subset A of a group G ‘generates’ G if every element of G 
may be written as a product of elements of A and of inverses of elements of A. We 
will deal with this notion more formally in §6.3 and with descriptions of groups in 
terms of generators and relations in §8.2. 

2.2. Dihedral groups. A 1 symmetry'’ is a transformation which preserves a struc- 
ture. This is of course just a loose way to talk about automorphisms, when we may 
be too lazy to define rigorously' the relevant category. As automorphisms of objects 
of a category, symmetries will naturally form groups. 

One context in which this notion may be visualized vividly is that of ‘geometric 
figures’ such as polygons in the plane or polyhedra in space. The relevant category 
could be defined as follows: let the objects be subsets of an ordinary plane R 2 and 
let morphisms between two subsets A, B consist of the ‘rigid motions’ of the plane 
(such as translations, rotations, or reflections about a line) which map A to a subset 
of B. A rigorous treatment of these notions would be too distracting at this point, 
so we will appeal to the intuition of the reader, as we do every now and then. 

Rom this perspective, the ‘symmetries’ of a subset of the plane are the rigid 
motions which map it onto itself; they clearly form a group. 

The dihedral groups may' be defined as these groups of symmetries for the 
regular polygons. Placing the polygon so that it is centered at the origin (thereby 
excluding translations as possible symmetries), we see that the dihedral group for 
a regular n-sided polygon consists of the n rotations by 2ir/n radians about the 
origin and the » reflections about lines through the origin and one of the vertices. 

Thus, the dihedral group for a regular resided polygon consists of 2« elements; 
we will denote 11 this group by the symbol D 2 „. 

Again, contemplating these groups, at least for small values of n, is a wonder- 
ful exercise. There is a simple way to relate the dihedral groups to the symmetric 
groups of §2.1, capturing the fact that a symmetry' of a regular polygon P is de- 
termined by the fate of the vertices of P. For example, label the vertices of an 
equilateral triangle clockwise by 1, 2, 3; then a counterclockwise rotation by an 
angle of 2x/3 sends vertex 1 to vertex 3, 3 to 2, and 2 to 1, and no other symmetry 
of the triangle does the same. 

l0 It may of course also be checked by explicit computation of the corresponding permutations, 
but we are trying to illnstrate the fact that the relations are ‘all we need to know’. 

^Unfortunately there does not seem to be universal agreement on this notation: some ref- 
erences use the symbol D n for what we call D 2n here. 
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Visually, this looks like 



In other words, we can associate with the counterclockwise rotation of an equilateral 
triangle by 2n/3 the permutation 



Such a labeling defines a function 

— * S-ji 

further, this function is injective (sinoe a symmetry is determined by the permuta- 
tion of vertices it induces). In fancier language, which we will master in due time, 
we say that Da acts (faithfully) on the set {1,2,3}. 

It is dear that this can be done in several ways (for example, we could label the 
vertices in different ways). However, any such assignment will have the property 
that composition of symmetries in D 2n corresponds to composition of permutations 
in S„; the reader should carefully work this out for several examples involving 
Da — + S 3 and Ds — * S 4 . 

A concise way to describe the situation is that these functions are (group) 
homomorphisms (cf. §4). Since both D$ and S 3 have 6 elements and the function 
De — ► S 3 given above is injective, it must also be surjective. Thus there are bijective 
homomorphisms between Dq and S 3 ; we say that these groups are isomorphic 
(cf. §4.3). We will study these concepts very carefully in the next several sections. 

As an alternative (and more abstract) way to draw the same condusion, denote 
by y the counterclockwise rotation considered above and by x the reflection about 
the line through the center and vertex 3 of our equilateral triangle: 



Reflecting twice gives the identity, as does rotating three times; thus 

x 2 = e, y 3 = e. 

Further, yx (rotating counterclockwise by 2tr/3, then flipping about the slanted 
line) is the same symmetry as xy 2 (flipping first, then rotating clockwise by 2 it/Z). 
That is, 


yx = xy 2 . 
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In other words, the group £>6 is also generated by two elements a;, y, subject to the 
relations a; 2 = e, j/ 3 = e, yx = xy * — precisely as we found for S3. Since D§ and S3 
admit matching descriptions in terms of generators and relations (that is, matching 
presentations; cf. §8.2), ‘of course’ they are isomorphic. 

2 . 3 . Cyclic groups and modular arithmetic. Let n be a positive integer. Con- 
sider the equivalence relation 011 Z defined by 12 

(Va,6eZ): a=b mod n ■«=** n | (b- a). 

This is called congruence modulo n. We have encountered this relation already, for 
n = 2 , in Example 1 . 1 . 3 . It is very easy to check that it is an equivalence relation, 
for all n; the set of equivalence classes is often denoted by Z„, Z/(n), Z/nZ, or F n . 
We will opt for Z jnh, which is not preempted by other notions 13 . We will denote 
by [a] n the equivalence class of the integer a modulo n, or simply [a] if no ambiguity 
arises. 

The reader should check carefully that Z/nZ consists of exactly n elements, 
namely 

[0]»> ll]n ,•••, [w]. 

We can use the group structure on Z to induce an (abelian) group structure on 
Z/nZ. In order to do this, we define an operation + on Z/jiZ, by setting Va, b 6 Z 

[a) + [6] := [a +6). 

Of course we have to check that this prescription is well-defined; luckily, this is 
very easy: the following small lemma does the job, as it shows that the result of 
the operation does not depend 011 the representatives chosen for the classes. 

Lemma 2 . 2 . If a = a' mod n and b = i/ mod n, then 
(a + b) = ( a' + i/) mod n. 

Proof. By hypothesis n | (o' — o) and « | (&' - 6); therefore ik, t € Z such that 
(o' - o) = kn, (b 1 - b) = In. 

Then 

(o' + b') — (a + b) = (o' — o) + (6' — 6) = kn + In = (fc + £)n, 
proving that n divides (o' 4 - V) — (o + 6), as needed. □ 

Therefore, we have a binary operation + on Z/nZ. It is immediately checked 
that the resulting structure is a group. Associativity is inherited from Z: 

(M + 16]) + M = [« + *] + [c] = [(a + b) + c] = [o + (b + c)] = [0] + [6 + c] = (a] + ((6] + [c]); 
and so are the identity [0] and ‘inverse’ — [a] = [—a]. 

It is also immediately checked that the resulting groups Z/nZ are commutative, 
as the abelian-style notation suggests: 

[o] + [6] — [o + 6] = [5 + a] = [6] + [a]. 

12 The notation n | m stands for n is a divisor of m; that is, m — nk for some integer k. 
13 When n — p is prime, Zp is the official notation for ‘p-adic integers’, which are a completely 
different concept; see Exercise VII 1.1.19. 
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We trust that this material is not new to the reader, who should in any case 
check all these assertions carefully. 

The abelian groups thus obtained, together with Z, are called cyclic groups; 
a popular alternative notation for the group (Z/nZ, +) is C n . This is adopted 
especially when one wants to use the ‘multiplicative’ rather than ‘additive’ notation; 
thus we can say that C n is generated by one element x, with the relation x n = e. 

Cyclic groups are tremendously important, and we will come back to them in 
later sections. For the time being we record the fact that the element 

[l]n € Z/tiZ 

generates the group, in the sense that every other element may be obtained as a 
multiple of this element. For example, if to > 0 is an integer, then 

[m] n = [1 H 1- ljn = [l]n H H [l]n = m • [!]«• 

V - * _ ' 

m times m times 

Equivalently, we may phrase this fact by observing that the order of [l] n in Z/tiZ 
is n: this implies that the n multiples 0 • [l] n , 1 - [1]„, . . . , (n — 1) • [l] n must all he 
distinct, and hence they must fill up Z/nZ. 

Proposition 2.3. The order of [m]„ in Z/raZ is 1 if n \ m, and more generally 

IMttl = — -Tp- — y 

gcd(m, n) 

Proof. If n | m, then [m]„ = [0] tt < If n does not divide m, observe again that 
[m)„ = m [l]„ and apply Proposition 1.13. □ 

Remark 2.4, As a consequence, the order of every element of Z/nZ divides n = 
|Z/nZ|, the order of the group. We will see later (Example 8.15) that this is a 
general features of the order of elements in any finite group. j 

Corollary 2.5. The class [m]« generates T/n& if and only i/gcd(m, n) = 1. 

This simple result is quite important. For example, if n = p is a prime integer, 
it show's that every nonzero class in the group Z/pZ generates it. In any case, 
it allows us to construct more examples of interesting groups. The reader should 
check (or recall; cf. Exercise 2.14) that there also is a well-defined multiplication on 
Z/nZ, given by 

[®]n * P*]n * = 

This operation does not define a group structure on Z/nZ: indeed, the class [0] n 
does not have a multiplicative inverse. On the other hand, for any positive n denote 
by (Z/nZ)* the subset of Z/nZ consisting of classes [m] n such that gcd(m,n) = 1: 

(Z/nZ)* := {[m] n € Z/nZ | gcd(m, n) = 1}. 

This subset is clearly well-defined: if to = to' mod n, then ged(m, n) = 1 <==>■ 
gcd(m',n) = 1 (Exercise 2.17), so the defining property of classes in (Z/nZ)* is 
independent of representatives. 

Proposition 2.6. Multiplication makes (Z/nZ)’ into a group. 
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Proof. Simple properties of god’s show that if gcd(mi,n) = gcd (m2, ra) = 1, then 
gcd(m 1 m2, n) = 1. (For example, if a prime integer divided both n and mi m2, then 
it would necessarily divide mi or m2, and one of the two gcd’s would not be 1.) 
Therefore, the product of two elements in (Z/nZ)* is an element of (Z/nZ)*, and 
• does define a binary operation 

(Z/nZ)* x (Z/nZ)* - (Z/nZ)\ 

It is clear that this operation is associative (because multiplication is associative 
in Z); [l] n is an element of (Z/nZ)* and is an identity with respect to multiplication; 
so all we have to check is that elements of (Z/nZ)* have multiplicative inverses in 
(Z/nZ)*. 

This follows from Corollary 2.5. If gcd(m, n) = 1, then [m]„ generates the 
additive group Z/nZ, and hence some multiple of [m]„ must equal [1]*: 

(3« € Z) : a • [m)„ = [l] re ; 

this implies 

[o]„[m]„ = [l]n- 

Therefore [m]„ does have a multiplicative inverse in Z/nZ, namely [a]„. The reader 
will verify that gcd(o, n) = 1, completing the proof. □ 

For instance, [8] is has a multiplicative inverse in (Z/15Z)*. Tracing the argu- 
ment given in the proof, 

2 • 8 + (—1) • 15 = 1, 

and hence [2] 15 • [8] 15 = [l]is: the multiplicative inverse of [8] 15 is [2] 15. 

For n = p a positive prime integer, the group ((Z/pZ)V) has order (p — 1). 
We will have more to say about these groups in later sections (cf. Example 4.6). 


Exercises 


2.1. -1 One can associate annxn matrix M 9 with a permutation <r e 5 n by letting 
the entry at (i, <r(i)) be 1 and letting all other entries be 0. For example, the matrix 
corresponding to the permutation 


a = 



€ S3 


would be 



Prove that, with this notation, 




for all <7, r € S n , where the product on the right is the ordinary product of matrices. 
[IV .4. 13] 

2.2. > Prove that if d < n, then S„ contains elements of order d. (§2.1) 
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2.3. For every positive integer n find an element of order n in Sft. 

2.4. Define a homomorphism Dg — #• S 4 by labeling vertices of a square, as we did 
for a triangle in §2.2. List the 8 permntations in the image of this homomorphism. 

2.5. &■ Describe generators and relations for all dihedral groups D 2 „. (Hint: Let x 
be the reflection about a line through the center of a regular n-gon and a vertex, and 
let y be the counterclockwise rotation by 2ir/n. The group Din will be generated 
by x and y, subject to three relations 14 . To see that these relations really determine 
Di n , use them to show that any product x n y t2 x' s y u ■ • • equals x'y 3 for some i, j 
with 0 < * < 1, 0 < j < n.) [8.4, §IV.2.5] 

2.6. > For every positive integer n construct a group containing two elements g, h 
such that \g\ = 2, |/i| = 2, and |ph| = n. (Hint: For n > 1, Di„ will do.) (§1.6] 

2.7. Find all elements of £> 2 „ that commute with every other element. (The parity 
of n plays a role.) 

2.8. Find the orders of the groups of symmetries of the five ‘platonic solids’. 

2.9. Verify carefully that ‘congruence mod ra’ is an equivalence relation. 

2.10. Prove that Z/nZ consists of precisely n elements. 

2.11. t> Prove that the square of every odd integer is congruent to 1 modulo 8. 
[SVII.5.1] 

2.12. Prove that there are no integers a, b, c such that a 2 + b 2 — 3c 2 . (Hint: By 
studying the equation [a]| + [b]| = 3[c]^ in Z/4Z, show that a, b, c would all have 
to be even. Letting a = 2k, b = 2t, c = 2m, you would have k 2 +£ 2 = 3m 2 . What’s 
wrong with that?) 

2.13. > Prove that if gcd(m, n) = 1, then there exist integers a and b such that 

am + bn = 1. 

(Use Corollary 2.5.) Conversely, prove that if am . + bn = 1 for some integers a and 
b, then gcd(m,») = 1. [2.15, §V.2.1, V.2.4] 

2.14. o State and prove an analog of Lemma 2.2, showing that the multiplication 
on Z/nZ is a well-defined operation. [§2.3, §111.1.2] 

2.15. -> Let n > 0 be an odd integer. 

• Prove that if gcd(m, n) = 1, then gcd(2m + n, 2n) = 1. (Use Exercise 2.13.) 

• Prove that if gcd(r, 2 n) = 1, then gcd( 2 ^ 2 , n) = 1. (Ditto.) 

• Conclude that the function [m] n — ► (2m + n\i n is a bijection between (Z/nZ)* 
and (Z/2nZ)*. 


14 Twq relations are evident. To ‘see’ the third one, hold your right hand in front of and 
away from you, pointing your 6ngere at the vertices of an imaginary regular pentagon. Flip the 
pentagon by turning the hand toward you; rotate it counterclockwise w.r.t. the line of sight by 
72°; flip it again by pointing it away from you; and rotate it counterclockwise a second time. This 
returns the hand to the initial position. What does this tell you? 
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The number 4>{n) of elements of (Z/nZ)* is Euler’s <j> -function. The reader has just 
proved that if n is odd, then $(2 n) = <&(ra). Much more general formulas will be 
given later on (cf. Exercise V.6.8). [VTI.5.1 1) 

2.16. Find the last dipt of 1238237 1 8238450 . (Work in Z/10Z.) 

2.17. > Show that if m s mf mod n, then gcd(m, n) = 1 if and only if gcd(rra', ra) = 
1. [§2.3] 

2.18. For d<n, define an injective function Z/dZ — ► S n preserving the operation, 
that is, such that the sum of equivalence classes in ZjnZ corresponds to the product 
of the corresponding permutations. 

2.19. > Both (Z/5 Z)* and (Z/12Z)* consist of 4 elements. Write their multiplica- 
tion tables, and prove that no re-ordering of the elements will make them match. 
(Cf. Exercise 1.6.) [§4.3] 


3. The category Grp 

Groups will be the objects of the category Grp. In this section we define the 
morphisms in the category and deal with simple properties of these morphisms. 

3.1. Group homomorphisms. As we know, a group consists of two distinct 
types of information: a set G and an operation 15 

mg :GxG-»G 

satisfying certain properties. For two groups (G.tog) and (H,mj}), a group homo- 
morphism 

V : ( G , mo) -* (if, mu) 

is first of all a function (usually given the same name, <p in this case) between the 
underlying sets; but this function must ‘know about’ the operations mg on G, m# 
on if. What is the most natural requirement of this sort? 

Note that the set-function <p:G — *■ H determines a function 
(<pxip): GxG — > H x H : 

we could invoke the universal property of products to obtain this function (cf. Ex- 
ercise 3.1), but since we are dealing with sets, there is no need for fancy language 
here — just define the function by 

(V(o, 6) e G x G) : (ipx <p)(a, b) = (<p(o), <p(b)). 

There is an diagram combining all these maps: 


G xG ■ 


‘HxH 



l5 ln §1, me was denoted ♦; here we need to keep track of operations on different groups, so 
for a moment we will use a symbol recording the group (and evoking ‘multiplication’). 
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What requirement could be more natural than asking that this diagram commute? 

Definition 3.1. The set-function <p : G — ► H defines a group homomorphism if the 
diagram displayed above commutes. j 


This is a seemingly complicated way of saying something simple: since ip and 
rag, mu are functions of sets, commutativity means the following. For all a, b € G, 
the two ways to travel through the diagram give 


(«.&) | 

a -61 


(a, b) I > ( <p(a ), <p(b)) 

I I 

¥>(«)• <p(b) 


where we now write ■ for both operations: in G on the left, in if on the right. 
Commutativity of the diagram means that we must get the same result in both 
cases; therefore, Definition 3.1 can be rephrased as 

the set- function <p:G — > H is a group homomorphism if Va, be G 
(p(a ■ b) — <p(a) ■ <p(b). 


In other words, <p is a homomorphism if it ‘preserves the structure’. This may 
sound more familiar to our reader. As usual, the reason to bring in diagrams (as 
in Definition 3.1) is that this would make it easy to transfer part of the discussion 
to other categories. 

If the context is clear, one may simply write ‘homomorphism’, omitting the 
qualifier ‘group’. 

3.2. Grp: Definition. For G, H groups 18 we define 

Horn Gtp (G,H) 

to be the set of group homomorphisms G —* H. 

H G, H, K are groups and <p : G -* H, $ : H -+ K are two group homo- 
morphisms, it is easy to check that the composition a <p •. G -* K is a group 
homomorphism: from the diagram point of view, this amounts to observing that 
the ‘outer rectangle’ in 


(ifjOifi) X (ipotp) 



X ftOtfi 


16 We are yielding to the usual abuse of language that lets us omit explicit mention of the 
operation. 
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must commute if the two ‘inner rectangles’ commute. For arbitrary u, b € G, this 
means, 

<* «?)(«•*)« ■ *>)) = • ¥>(&)) = *(?(«)) • i>Mb)) 

= {4>°<p)(a) ■(■>!><> <p){b) 

where (1) holds because tp is a homomorphism and (2) holds because ^ is a homo- 
morphism. 

Farther, it is clear that composition is associative (because it is for set- functions) 
and that the identity function idG : G -* G is a group homomorphism. Therefore, 
Grp is indeed a category. 

3.3. Pause for reflection. The careful reader might raise an objection: the group 
axioms prescribe the existence of an identity element ea and of an ‘inverse’, that 
is, a specific function 

i g :G->G, 

Shouldn’t the definition of morphism in Grp keep track of this type of data? The 
definition we have given only keeps track of the multiplication map mg. 

The reason why we can get away with this is that preserving m automatically 
preserves e and i: 

Proposition 3.2. Let <p :G — ► H be a group homomorphism. Then 

• Vi e c) = eff,' 

♦ vg e g, vig- 1 ) = via)- 1 . 

In terms of diagrams, the second assertion amounts to saying that 

G — H 

4 4 

G-^H 


must commute. 

Proof. The first item follows from the definition of homomorphism and cancella- 
tion: since eg = ejj ■ eg, 

e H ■ <p(e G ) = <p(e G ) = <p{e G • e G ) = <p[e G ) ■ (p{e G ), 

which implies e# = <p(e G ) by ‘cancelling <p(e G y. 

The proof of the second assertion is similar: Vp € G, 

<p(a~ l ) ■ <p{a) = <p(a~ l • 9) = <P(ec) = ejr = (pig ) _1 • <p(g), 
implying tp(g -1 ) = <p(p) -1 by cancellation. □ 
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3.4. Products et aL The categories Grp and Set look rather alike at first: given 
a group, we can ‘forget’ the information of multiplication and we are left with a set; 
given a group homomorphism, we can forget that it preserves the multiplication 
and we are left with a set-function. A concise way to express this fact is that there 
is a ‘functor’ Grp Set (called in fact a ‘forgetful’ functor); we will deal with 
functors more extensively much later on, starting in Chapter VIII. 

However, there are important differences between these two categories. For 
example, recall that Set has a unique initial object (that is, 0) and this is not the 
same as the final objects (that is, the singletons). Also recall that a trivial group 
is a group consisting of a single element (Example 1.3). 

Proposition 3.3. Trivial groups are both initial and final in Grp. 

This makes tririal groups ‘zero-objects’ of the category Grp. 

Proof. It should be clear that trivial groups are final: there is only one function 
from a set to a singleton, that is, the constant function; this is vacuously a group 
homomorphism. 

To see that trivial groups are initial, let T - {e} be a trivial group; for any 
group G, define ip :T —* G by T(e) = cq. This is clearly a group homomorphism, 
and it is the only possible one since every group homomorphism must send the 
identity to the identity (Proposition 3.2). □ 

Here is a similarity: Grp has products; in fact, the product of two groups G, H 
is supported on the product G x if of the underlying sets. 

To see this, we need to define a multiplication on G x H\ the catchword here 
is componentwise: define the operation in G x H by performing the operation on 
each component separately. Explicitly, define V 91,92 € G, Vhi.h? e H 

(&i<hi) ■ (92J12) := (9182, hi ^2). 

This operation defines a group structure on GxH: the operation is associative, the 
identity is (ec,en), and the inverse of (3, h) is (<7 _1 ,fc -1 ). All needed verifications 
are left to the reader. The group GxH is called the direct product of the groups G 
and H. 

Also note that the natural projections 


GxH 



(defined as set-functions as in §1-2.4) are group homomorphisms: again, this follows 
immediately from the definitions. 

Proposition 3.4. With operation defined componentwise, G x H is a product 
in Grp. 

Proof. Recall (§1.5.4) that this means that GxH satisfies the following universal 
property: for any group A and any choice of group homomorphisms <pg ■ A — * G, 
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ipH : A — * if, there exists a unique group homomorphism <pa x ipH making the 
diagram 



commute. 

Now, a unique set-function <pg * <Ph exists making the diagram commute, 
because the set G x if is a product of G and H in Set. So we only need to check 
that tpa x tpii is a group homomorphism, and this is immediate (if cumbersome): 
Vo, be A, 

<PG x <pH(ab) = (v>c(ai>), <pn(ab)) = (<pG(a)tpG(b),tpn(a)tpn(b)) 

= (^ 0 ( 0 ), ^«( o ))( vg ( 6 ), ¥>«(&) ) = (<Pg x tpn(a))(<pG x <fin(b)). 

□ 

What about coproducts? They do exist in Grp, but their construction requires 
handling presentations more proficiently than we do right now, and general coprod- 
ucts of groups will not be used in the rest of the book; so the reader will have to 
deal with them on his or her own. For an important example, see Exercise 3.8; 
more will show up in Exercises 5.6 and 5.7, since free groups are themselves par- 
ticular cases of coproducts. The reader will finally produce the coproduct of any 
two groups explicitly in Exercise 8.7. For now, just realize that the disjoint union, 
which works as a coproduct in Set (Proposition 1.5.6), is not an option in Grp: there 
is no reasonable group structure on the disjoint union. The coproduct of G and H 
in Grp is denoted G * H and is called the free product of G and H. 

3.5. Abelian groups. The category Ab whose objects are abelian groups, and 
whose morphisms are group homomorphisms, will in a sense be more important for 
us than the category Grp. In many ways, as we will see, Ab is a ‘nicer’ category 17 
than Grp. Again the trivial groups are both initial and final (that is, ‘zero’) objects; 
products exist and coincide with products in Grp. But here is a difference: unlike 
in Grp, coproducts in Ab coincide with products. That is, if G and H are abelian 
groups, then the product GxH (with the two natural homomorphisms G —* GxH, 
H — » Gx if) satisfies the universal property for coproducts in Ab (cf. Exercise 3.3). 
When working as a coproduct, the product GxH of two abelian groups is often 
called their direct sum and is denoted G © H. 

There is a pretty subtlety here, which may highlight the power of the language: 
even if G and H are commutative, the product GxH does not (necessarily) satisfy 

17 As we will see in due time (Proposition III. 5. 3), Ab is one instance of a general class of 
categories of ‘modules over a commutative ring R’ (for R = 2). Unlike Grp, these categories are 
abeiian, which makes them very well-behaved. We will learn two or three things about abelian 
categories in Chapter IX. 
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the universal property for coproducts in Grp, even if it does in Ab. For an explicit 
example, see Exercise 3.6. 


Exercises 

3.1. > Let <p : G -* H be a morphism in a category C with products. Explain why 
there is a unique morphism 

{<p x tp) : G x G -* H x H. 

(This morphism is defined explicitly for C = Set in §3.1.) [§3.1, 3.2] 

3.2. Let tp : G — * H, il> : H -* K be morphisms in a category with products, and 
consider morphisms between the products G xG, H x H, K x K as in Exercise 3.1. 
Prove that 

(ij><p) X (i/xp) = (il>x V»)(<p X I p). 

(This is part of the commutativity of the diagram displayed in §3.2.) 

3.3. t> Show that if G, H are abelian groups, then G x H satisfies the universal 
property for coproducts in Ab (cf. §1.5.5). (§3.5, 3.6, §111.6.1] 

3.4. Let G, H be groups, and assume that G = H x G. Can you conclude that H 
is trivial? (Hint: No. Can you construct a counterexample?) 

3.5. Prove that Q is not the direct product of two nontrivial groups. 

3.6. t> Consider the product of the cyclic groups C 2 , C 3 (cf. §2.3): C 2 x C 3 . By 
Exercise 3.3, this group is a coproduct of C 2 and C 3 in Ab. Show that it is not a 
coproduct of C 2 and C 3 in Grp, as follows: 

• find injective homomorphisms C% —* S 3 , C 3 — ► S 3 ; 

• arguing by contradiction, assume that C 2 x C 3 is a coproduct of C 2 , C 3 , and 
deduce that there would be a group homomorphism C? x C 3 —* S 3 with certain 
properties; 

• show that there is no such homomorphism. 

[§3.5] 

3.7. Show that there is a surjective homomorphism Z * Z — * C 2 * C 3 . (* denotes 
coproduct in Grp; cf. §3.4.) 

One can think of Z * Z as a group with two generators x, 3 /, subject to no 
relations whatsoever. (We will study a general version of such groups in §5; see 
Exercise 5.6.) 

3.8. > Define a group G with two generators x,y, subject (only) to the relations 
x 2 — ec, y 3 ~ cq- Prove that G is a coproduct of C 2 and C 3 in Grp. (The reader 
will obtain an even more concrete description for * C 3 in Exercise 9.14; it is 
called the modular group.) [§3.4, 9.14] 

3.9. Show that fiber products and coproducts exist in Ab. (Cf. Exercise 1.5.12.) 
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4. Group homomorphisms 

4.1. Examples. For any two groups <3, H, the set Hohlg^G, H) is certainly 
nonempty: at the very least we can define a homomorphism <3 — * H by sending 
every element of G to the identity e# of H. Thus, Hom^^G, H) is a ‘pointed set’ 
(cf. Example 1.3.8). 

There is a purely categorical way to think about this: since Grp has zero-objects 
{*} (recall that trivial groups are both initial and final in Grp; cf. Proposition 3.3), 
there are unique morphisms 

G {*}, {*} — H 

in Grp; their composition is the distinguished element of Hom&^G, H) mentioned 
above; we will call this element the trivial morphism. 

More ‘meaningful’ examples can be constructed by considering the groups en- 
countered in §2: for instance, the function Ds — > S 3 defined in §2.2 is a homomor- 
phism. Such examples null likely all be instances of the uotiou of group action. In 
general, an ‘action’ of a group G on an object A of a category C is a homomorphism 

G —♦ Autc(-A); 

that is, a group G ‘acts’ on an object if the elements of G determine isomorphisms 
of that object to itself, in a way compatible with compositions. If C = Set, this 
means that elements of G determine specific permutations of the set A. For exam- 
ple, the symmetries of an equilateral triangle (that is, elements of Gg) determine 
permutations of the vertices of that triangle (that is, permutations of a set with 
three elements), and they do so compatibly with composition; this is what gives 
us homomorphisms D s — > S 3 . We say that D 6 ‘acts on the set of vertices’ of the 
triangle. 

The reader can (and should) construct many more examples of this kind. We 
will haw much more to say about actions of groups and other algebraic entities in 
later sections. 

Here is an example with a different flavor: the exponential function is a homo- 
morphism from (R, +) to the group (R >0 , •) of positive real numbers, with ordinary 
multiplication as operation. Indeed, e a+b = e“e 6 . A similar (and very important) 
class of examples may be obtained as follows: let G be any group and g 6 G any 
element of C; define an ‘exponential map’ e g : Z — ► G by 

(Va6Z): c s (o):= 5 “. 

Then e g is (clearly) a group homomorphism. The element g generates G if and only 
if e g is surjective. 

One concrete instance of this homomorphism (in the abelian environment, thus 
using multiples rather than powers) is the ‘quotient’ function n n : Z — * Z/raZ, 

ai-» a - (1]« = [a]„ : 

with the notation introduced above, this is eji], . This function is surjective; hence 
[1]„ generates Z/raZ. In fact, as observed in §2.3 (Corollary 2.5), [m] n generates 
Z/nZ if and only if gcd(m,») = 1. 
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If m | n, there is a homomorphism 

tr” : 2/r iZ — > ZjmZ 


making the diagram 

Z 

«■„ 

ZjnZ — -j-4 ZjmZ 

commute: that is, 

*m(l°]n) = Hm, 

the reader should check carefully that this function is well-defined (Exercise 4.1). 

If mi and mj are both divisors of n, we have homomorphisms ir^ 1 , from 
Z/nZ to both Z/m.iZ and ZjmtZ and hence to their direct product. For instance, 
since 6 = 2-3, there is a homomorphism 

2/6 Z -> 2/22 x 2/32 

(or, in ‘multiplicative notation’, Ce — ► C2 x Cs)> Explicitly, 

10]6-((0]2,[0] 3 ), (1]6-([1]2,(1] 3 ), (2]6-([0] 2 ,[2] 3 ), 

(3]b >-* (( 1)2, [0)3), [4]e •-» ([0) 2 , (1] 3 ), [5)$ •-> ([1)2, (2] 3 ). 

Note that this homomorphism is a bijection; as we will see in a moment (§4.3), this 
mates it an isomorphism; in particular, C% is also a product of C'2 and C 3 in Grp. 

One can concoct a homomorphism Z/n2 — > ZjmZ also if n \ m: for example, 
the function 2/22 — ♦ 2/42 defined by 

(0]2 -+ (0] 4 , [1] 2 -* [2] 4 

is clearly a group homomorphism. Unlike ir^, this homomorphism is not nicely 
compatible 18 with the homomorphisms n„. 

On the other hand, is there a nontrivial group homomorphism (for example) 
C4 — > Cj! Note that there are 7 4 = 2,401 set-functions from G4 to Cj (cf. Ex- 
ercise 1.2.10); the question is whether any of these functions (besides the trivial 
homomorphism sending everything to e) preserves the operation. We already know 
that a homomorphism must send the identity to the identity (Proposition 3.2), and 
that already rules out all but 216 functions (why?); still, it is unrealistic to write 
all of them out explicitly to see if any is a homomorphism. 

The reader should think about this before we spill the beans in the next sub- 
section. 



18 Also, note that while r" 1 preserves multiplication as well as sum, this new homomorphism 
does not; that is, it is not a ‘ring homomorphism’. This is immediately visible in the given 
example: [1] 2 ■ [1)2 = [1)2, but (2) 4 • (2)4 = [0] 4 . 
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4.2. Homomorphisms and order. Group homomorphisms are set- functions pre- 
serving the group structure; as such, they must preserve many features of the the- 
ory. Proposition 3.2 is an instance of this principle: group homomorphisms must 
preserve identities and inverses. It is also clear that if <p : G — • H is a group homo- 
morphism and g is an element of finite order in G, then <p(g) must be an element 
of finite order in H: indeed, if p" = eG for some n > 0, then 

= <f>(g n ) = <p(ec) = Cff. 

In fact, this observation establishes a more precise statement: 

Proposition 4.1. Let ip: G — > H be a group homomorphism, and let g € G be an 
element of finite order. Then \<p{g)\ divides |#|. 

Proof. As observed, <p(gY 9 I = e#; applying Lemma 1.10 gives the statement. □ 

Example 4.2. There are no nontrivial homomorphisms Z/nZ — > 2: indeed, the 
image of every element of Z/nZ must have finite order, and the only element with 
finite order in (Z, +) is 0. 

There are no nontrivial homomorphisms <p : C 4 — *■ Cj. Indeed, the orders of 
elements in Ca divide 4 (cf. Proposition 2.3), and the orders of elements in Cr 
divides 7. Thus, the order of each <p(g) must divide both 4 and 7; this forces 
|<p(g)| = 1 for all g, that is, <p(g) = e for all g € C 4 . j 

Of course the order itself is not preserved: for example, 1 6 Z has infinite 
order, while (l] n = 7 r n (l) € Z/nZ has order n (with notation as in §4,1). Order is 
preserved through isomorphisms, as we will see in a moment. 

4.3. Isomorphisms. An isomorphism of groups <p : G -* H is (of course) an 
isomorphism in Grp, that is, a group homomorphism admitting an inverse 

<p~' : H ->G 

which is also a group homomorphism. Taking our cue from Set, if a homomorphism 
of groups is an isomorphism, then it must in particular be a bijection between the 
underlying sets. Luckily, the converse also holds: 

Proposition 4.3. Let <p : G —* H be a group homomorphism. Then tp is an 
isomorphism of groups if and only if it is a bijection. 

Proof. One implication is immediate, as pointed out above. For the other impli- 
cation, assume <p : G H is a bijective group homomorphism. As a bijection, <p 
has an inverse in Set: 

<p~ l : H — G; 

we simply need to check that this is a group homomorphism. Let hi, h 2 be elements 
of H, and let g\ — <p~ l (hi), g 2 — <p~ l (hf) be the corresponding elements of G. Then 

<p~ l (hi h 2 ) = <? _ 1 (¥>(ih) ¥>(52)) = <f~ l (<p(9l $ 2 )) = 9i ■ 92 = ¥> -1 (M • V~H h 2 ) 


as needed. 


□ 
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Example 4.4. The function £>e — * S 3 defined in §2.2 hi an isomorphism of groups, 
since it is a bijective group homomorphism. So is the exponential function (R, +) — » 
(R >0 , •) mentioned in §4.1. If the exponential function e g : Z -* G determined by 
an element g € G (as in §4.1) is an isomorphism, we say that G is an ‘infinite cyclic’ 
group. 

The function Jr® x jr® : Cg — > C 2 x C 3 studied (‘additively’) in §4.1 is an 
isomorphism. -1 

Definition 4.5. Two groups G, H are isomorphic if they are isomorphic in Grp 
in the sense of §1.4.1, that is (by Proposition 4.3), if there is a bijective group 
homomorphism G — ► H. _i 

We have observed once and for all in §1.4.1 that ‘isomorphic’ is automatically 
an equivalence relation. We write G = H if G and H are isomorphic. 

Automorphisms of a group G are isomorphisms G —* G\ these form a group 
Autcp (G) (cf. §14.1), usually denoted Aufc(G). 

Example 4.6. We have introduced our template of cyclic groups in §2.3. The 
notion of isomorphism allows 11 s to give a formal definition: 

Definition 4.7. A group G is cyclic if it is isomorphic to Z or to C„ = Z/nZ for 
some 19 n. j 

Thus, Ci x C '3 is cyclic, of order 6 , since Ci x C 3 = Cfc. More generally 
(Exercise 4.9) C m x C n is cyclic if gcd(m, n) = 1 . 

The reader will check easily (Exercise 4.3) that a group of order n is cyclic if 
and only if it contains an element of order n. 

There is a somewhat surprising source of cyclic groups: if p is prime, the 
group ((Z/pZ)*, •) is cyclic. We will prove a more general statement when we have 
accumulated more machinery (Theorem IV.6.10), but the adventurous reader can 
already enjoy a proof by working out Exercise 4.11. This is a relatively deep fact; 
note that, for example, (Z/12Z)* is not cyclic (cf. Exercise 2.19 and Exercise 4.10). 
The fact that (Z/pZ)* is cyclic for p prime means that there must be integers a 
such that every nonmultiple of p is congruent to a power of a; the usual proofs of 
this fact are not constructive, that is, they do not explicitly produce an integer with 
this property. There is a very pretty connection between the order of an element of 
the cyclic group (Z/pZ)* and the so-called ‘cyclotomic polynomials’; but that will 
have to wait for a little field theory (cf. Exercise VII.5.15). 

As we have seen, the groups Da and S 3 are isomorphic. Are Cg and S 3 iso- 
morphic? There are 46,656 functions between the sets C 6 and S 3 , of which 720 are 
bijections and 120 are bijections preserving the identity. The reader is welcome to 
list all 120 and attempt to verify by hand if any of them is a homomorphism. But 
maybe there is a better strategy to answer such questions -1 

Isomorphic objects of a category are essentially indistinguishable in that cate- 
gory. Thus, isomorphic groups share every group-theoretic structure. In particular, 

19 This includes the possibility that n = 1, that is, trivial groups are cyclic. 
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Proposition 4.8. Let <p :G —* H be an isomorphism. 

• ( v 3 € G) : \g>(a)\ = |g|; 

• G is commutative if and only if H is commutative. 

Proof. The first assertion follows from Proposition 4.1: the order of p(g) divides 
the order of g, and on the other hand the order of g = ip _I (g?(g)) must divide the 
order of <p(g); thus the two orders must be equal. 

The proof of the second assertion is left to the reader. □ 

Further instances of this principle will be assumed without explicit mention. 

Example 4.9. Ce ¥ S3, since one is commutative and the other is not. Here is 
another reason: in C$ there is 1 element of order one, 1 of order two, 2 of order 
three, and 2 of order six; in 5s the situation is different: 1 element of order one, 3 
of order two, 2 of order three. Thus, none of the 120 bijections C% —* S3 preserving 
the identity is a group homomorphism. 

Note: Two finite commutative groups are isomorphic if and only if they have 
the same number of elements of any given order, but we are not yet in a position 
to prove this; the reader will verify this fact in due time (Exercise IV.6.13). The 
commutativity hypothesis is necessary: there do exist pairs of nonisomorphic finite 
groups with the same number of elements of any given order (same exercise). _i 

4.4. Homomorphisms of abelian groups. We have already mentioned that Ab 
is in some ways ‘better behaved’ than Grp, and we are ready to highlight another 
instance of this observation. As we have seen, HoniGrp(G, H) is a pointed set for 
any two groups G, H. In Ab, we can say much more: HomAb is a group (in fact, 
an abelian group) for any two abelian groups G, H. 

The operation in HomAb(G, H) is ‘inherited’ from the operation in H: if ip, ip : 
G —* H are two group homomorphisms, let <p + ip be the function defined by 

(Va e G) : (v? + i>)(a) := <p(a) + ip(a). 

Is <p + ip a group homomorphism? Yes, because Vo, b € G 

(<p + ip)(a + b) = ip(a + 6) + ip(a + 6) = (y(a) + y>(6)) + (^(a) + 1 p(b)) 

= M a ) + i>(a)) + (<p{b) + i>(b)) = (y> + ip) (a) + (<p + ip)(b). 
Note that the equality marked by ! uses crucially the fact that H is commutative. 

With this operation, HomAb(G, H ) is clearly a group: the associativity of + is 
inherited from that of the operation in H\ the trivial homomorphism is the identity 
element, and the inverse 20 of <f> : G — * H is defined (not surprisingly) by 

(Vo e G) : (-<p)(a) = ~<p(a). 

In fact, note that these conclusions may be drawn as soon as H is commutative: 
HomGrp(G, H) is a group if if is commutative (even if G is not). In fact, if H is 

^Unfortunate clash of terminology! We mean the ‘inverse’ as in ‘group inverse', not as a 
possible function H — * G. 
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a commutative group, then H A = Hornet (A, H) is a group for all sets A\ we will 
come back to this group in §5.4. 


Exercises 


4.1. > Check that the function tt£, defined in §4.1 is well-defined and makes the 
diagram commute. Verify that it is a group homomorphism. Why is the hypothesis 
m | » necessary? [§4.1] 

4.2. Show that the homomorphism 7r| x fff : C4 -» C2 x C2 is not an isomorphism. 
In fact, is there any nontrivial isomorphism C4 — » C2 x C2? 

4.3. t> Prove that a group of order n is isomorphic to Z/nZ if and only if it contains 
an element of order n. [§4.3] 

4.4. Prove that no two of the groups (Z, +), (Q, +), (R, +) are isomorphic to one 
another. Can you decide whether (R, +), (C, +) are isomorphic to one another? 
(Cf. Exercise VI.1.1.) 

4.5. Prove that the groups (R \ {0}, •) and (C n {0}, •) are not isomorphic. 

4.6. We have seen that (R, +) and (R >0 , •) are isomorphic (Example 4.4). Are the 
groups (Q, +) and (Q >0 , ■) isomorphic? 

4.7. Let G be a group. Prove that the function G — ► G defined by gw g -1 is a 
homomorphism if and only if G is abelian. Prove that g h-> g 2 is a homomorphism 
if and only if G is abelian. 

4.8. -i Let G be a group, and let g € G. Prove that the function 7 3 \G —* G defined 
by (Va € G ) : 7$(a) = gag _1 is an automorphism of G. (The automorphisms y g are 
called ‘inner’ automorphisms of G.) Prove that the function G —* Aut(G) defined 
by g 1— * is a homomorphism. Prove that this homomorphism is trivial if and 
only if G is abelian. [6.7, 7.11, IV.1.51 

4.9. > Prove that if m, n are positive integers such that gcd(m, n) = 1, then 
C mn = C m x C n . (§4.3, 4.10, §IV.6.1, V.6.8] 

4.10. > Let p g be odd prime integers; show that (Z/pqZ)* is not cyclic. (Hint: 
Use Exercise 4.9 to compute the order N of (Z/pqZ)*, and show that no element 
can have order N.) [§4.3] 

4.11. > In due time we will prove the easy fact that if p is a prime integer, then 
the equation x d = 1 can have at most d solutions in Z/pZ. Assume this fact, and 
prove that the multiplicative group G = (Z/pZ)* is cyclic. (Hint: Let g € G be an 
element of maximal order; use Exercise 1.15 to show that ftl® 1 = 1 for all k e G. 
Therefore. . . .) [§4.3, 4.15, 4.16, §IV.6.3] 

4.12. -> • Compute the order of [9)31 in the group (Z/31Z)*. 

• Does the equation x 3 — 9 = 0 have solutions in 2/31 Z? (Hint: Plugging in 
all 31 elements of Z/31Z is too laborious and will not teach you much. Instead, 
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use the result of the first part: if c is a solution of the equation, what can you say 
about |c|V) [VII.5.15] 

4.13. -> Prove that Aut Grp (Z/2Z x Z/2Z) “ S 3 . [IV.5.14] 

4.14. t> Prove that the order of the group of automorphisms of a cyclic group C„ is 
the number of positive integers r < n that are relatively prime to n. (This is called 
Euler’s <f>- function ; cf. Exercise 6.14.) [§IV.1.4, IV.1.22, §IV.2.5] 

4.15. -> Compute the group of automorphisms of (Z, +). Prove that if p is prime, 
then Aut Grp (C' p ) = C p -i. (Use Exercise 4.11.) (IV.5.12] 

4.16. -> Prove Wilson’s theorem: a positive integer p is prime if and only if 

(p — 1)! = —1 mod p. 

(For one direction, use Exercises 1.8 and 4.11. Ibr the other, assume d is a proper 
divisor of p, and note that d divides (p— 1)!; therefore ) [IV.4.11] 

4.17. For a few small (but not too small) primes p, find a generator of (Z/pZ)*. 

4.18. Prove the second part of Proposition 4.8. 


5. Free groups 

5.1. Motivation. Having become more familiar with homoinorphisms, we can 
now contemplate one fancier example of a group. The motivation underlying this 
new construction may be summarized as follows: given a set A, whose elements have 
no special ‘group-theoretic’ property, we want to construct a group F(A) containing 
A ‘in the most efficient way’. 

For example, if A = 0, then a trivial group will do. If A = {a} is a singleton, 
then a trivial group will not do: because although a trivial group {«} would itself 
be a singleton, that one element a in it would have to be the identity, and that is 
certainly a very special group-theoretic property. Instead, we construct an infinite 
cyclic group (a) whose elements are ‘formal powers’ a", n e Z, and we identify a 
with the power a 1 : 

(a) :={-• • , a -2 , a -1 , a° = e, a 1 = a, a 2 , a 3 , • - • 

we take all these powers to be distinct and define multiplication in the evident 
way — so that the exponential map 

e 0 :Z— *(o), e 0 (n):=o” 

is an isomorphism. The fact that ‘all powers are distinct’ is the formal way to 
implement the feet that there is nothing special about a: in the group F({a}) = (a), 
a obeys no condition other than the inevitable a 0 = e. 

Summarizing: if A is a singleton, then we may take F(A) to be an infinite 
cyclic group. 

The task is to formalize the heuristic motivation given above and construct a 
group F(A) for every set A. As we often do, we will now ask the reader to put 
away this book and to try to figure out on his or her own what this may mean and 
how it may be accomplished. 
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5.2. Universal property. Hoping that the reader has now acquired an individual 
viewpoint on the issue, here is the standard answer: the heuristic motivation is 
formalized by means of a suitable universal property. Given a set A, our group 
F(A) will have to ‘contain’ A\ therefore it is natural to consider the category & A 
whose objects are pairs (j, G), where G is a group and 

j ■. A -* G 

is a set-function 21 from A to G and morphisms 

(*,<?»)-»&,<?*) 

are commutative diagrams of set-functions 

Gj — ^->G 2 

ii 

A 

in which <p is required to be a group homomorphism. 

The reader will be reminded of the categories we considered in Example 1.3.7: 
the only difference here is that we are mixing objects and morphisms of one cate- 
gory (that is. Grp) with objects and morphisms of another (related) category (that 
is, Set). The fact that we are considering all possible functions A — * G is a way to 
implement the fact that we have no a priori group-theoretic information about A: 
we do not want to put any restriction on what may happen to the elements of A 
once they are mapped to a group G; hence we consider all possibilities at once. 

A free group F(A) on A will be (the group component of) an initial object 
in & ;A . This choice implements the fact that A should map to F(A) in the ‘most 
efficient way’: any other way to map A to a group can be reconstructed from 
this one, by composing with a group homomorphism. In the language of universal 
properties, we can state this as follows: F(A) is a free group on the set A if there is a 
set-function j : A — » F( A) such that, for all groups G and set-functions / : A — > G, 
there exists a unique group homomorphism y> : F{A) —> G such that the diagram 



A 


commutes. By general nonsense (Proposition I.S.4), this universal property defines 
F{A) up to isomorphism, if this group exists. But does F(A) exist? 

Before giving a ‘concrete’ construction of F(A), let’s check that if A = {a} is 
a singleton, then F(A) = Z, as proposed in §5.1. The function j : A—* Z will send 
a to 1 € Z. For any group G, giving a set-function / : A — ► G amounts to choosing 

- 1 We could assume that j is injective, identifying A with a subset of G\ the construction 
would be completely analogous, and the resulting group would be the same. However, considering 
arbitrary functions leadB to a stronger, more useful, universal property. 
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one element g = f{a) 6 G. Now, if a € G, then there is a unique homomorphism 
ip : Z — * G making the diagram 



w 


commute: because this forces y>(l) = <p o j (a) — f(a) — g, and then the homo- 
morphism condition forces <p(n) = g n . That is, <p is necessarily the exponential 
map e g considered in §4.1. Therefore, infinite cyclic groups do satisfy the universal 
property for free groups over a singleton. 

5.3. Concrete construction. As we know, terminal objects of a category need 
not exist. So we have to convince the reader that free groups F(A) exist, for every 
set A. 

Given any set A , we are going to think of A as an ‘alphabet’ and construct 
‘words’ whose letters are elements of A or ‘inverses’ of elements of A. To formalize 
this, consider a set A' isomorphic to A and disjoint from it; call a -1 the element 
in A' corresponding to a € A. A word on the set A is an ordered list 

(ai,02, , citi), 

which we denote by the juxtaposition 

W = OjQ 2 • • ‘O n , 

where each ‘letter’ a; is either an element a € A or an element a -1 € A'. We will 
denote the set of words on A by W (A); the number n of letters is the ‘length’ of w, 
we include in W(A) the ‘empty word’ w = ( ), consisting of no letters. 

For example, if A = {a} is a singleton, then an element of VF(A) may look like 

a - 1 a~ 1 «n»ao“ 1 oa ” 1 . 

An element of W ({*,?/}) may look like 

xxx~ 1 yy ~ 1 xxy ~ 1 x~ 1 yy ~ 1 xy ~ 1 x. 

Now the notation we have chosen hints that elements in W(A) may be redun- 
dant: for example, 

xyy _1 x and xx 

are distinct words, but they ought to end up being the same element of a group 
having to do with words. Therefore, we want to have a process of ‘reduction’ which 
takes a word and cleans it up by performing all cancellations. Note that we have 
to do this ‘by hand’, since we have not come close yet to defining an operation or 
making formal sense of considering a -1 to be the ‘inverse’ of a. 

Describing the reduction process is invariably awkward— it is a completely evi- 
dent procedure, but writing it down precisely and elegantly is a challenge. We will 
settle for the following. 
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• Define an ‘elementary’ reduction r : W(A) —* W(A): given w € W(A), search 
for the first occurrence (from left to right) of a pair aa~ l or a -1 a, and let 
r(w) be the word obtained by removing such a pair. In the two examples 
given above, 

r(a~ 1 a~ 1 a< ma~ 1 qa~ 1 l = a - 1 aaa - 1 aa -1 , 
t(x xx~ 1 w j~ 1 xxv ~ 1 x~ l w~ 1 xv~ l x) = xyy~ l xxy~ l x~ l yy~ 1 xy~ l x. 

• Note that r(w) = w precisely when ‘no cancellation is possible’. We say that 
w is a ‘reduced word’ in this case. 

Lemma 5.1. If w e W(A) has length n, then 22 is a reduced word. 

Proof. Indeed, either r(w) = w or the length of r(w) is less than the length of 
w; but one cannot decrease the length of w more than n/2 times, since each non- 
identity application of r decreases the length by two. □ 

• Now define the ‘reduction’ R . : W(A) —* W(A) by setting R(w) = rl7-l(w), 
where n is the length of in. By the lemma, R(w) is always a reduced word. 
For example, I?(a _1 a“ 1 aoQO _1 aa -1 ) is the empty word, since 

r 4 (a -, a -, aaaa - 1 aa -1 ) = r 3 (a _ 1 oaa _ 1 aa _1 ) = r 2 (aa _ 1 aa _1 ) = r(oa _1 ) = ( ); 

and R.(xxx~ l yy~ l xxy~ l x~ 1 yy~ 1 xy~ l x) = xxxy~ 1 y~ l x, as the reader may 
check. 

Let F(A) be the set of reduced words on A, that is, the image of the reduction map 
R we have just defined. 

We are ready to (finally) define free groups ‘concretely’. Define a binary oper- 
ation on F(A) by juxtaposition & reduction: for reduced words w, w', define w • w' 
as the reduction of the juxtaposition of w and w‘ , 

w ■ uf := R(xow‘). 

It is essentially evident that F(A) is a group under this operation: 

• The operation is associative. 

• The empty word e = ( ) is the identity in F(A), since ew = we = w (no 
reduction is necessary). 

• If tn is a reduced word, the inverse of to is obtained by reversing the order of 
the letters of w and replacing each a € A by a -1 € A' and each a -1 by a. 

The most cumbersome of these statements to prove formally is associativity; it 
follows easily from (for example) Exercise 5.4. 

There is a function j : A -* F(A), defined by sending the element « € A to the 
word consisting of the single ‘letter’ a. 

Proposition 5.2. The pair ( j , F(A)) satisfies the universal property for free groups 
on A. 


22 


[<?J denotes the largest integer < q. 
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Proof. This is also essentially evident, once one has absorbed all the notation. 
Any function / : A — > G to a group extends uniquely to a map tp : F(A) — » 
G, determined by the homomorphism condition and by the requirement that the 
diagram commutes, which fixes its value on one-letter words a € A (as well as on 
« _1 € A'). 

To check more formally that <p exists as a homomorphism, one can proceed as 
follows. If / : A — * G is any function, we can extend / to a set-function 

£ : W(A) -* G 

by insisting that on one-letter words a or a -1 (for a £ A), 
ip(a) = /(a), £(a _1 ) = /(a) -1 , 

and that <p is compatible with juxtaposition: 

<p(ww') = ip(w)<p(w’) 

for any two words w, w' . The key point now is that reduction is invisible for <p: 

<p(R(w)) = <p{w), 

since this is clearly the case for elementary reductions; therefore, since <p : F(A) —*■ 
G agrees with <p on reduced words, we have for v>,vf e F(A) 

<p(w • w') = <p(w ■ w') = <p(R(ww')) = ip(ww') = <p(w)<p(w') = tp(w)ip(w') : 
that is, <p is a homomorphism, as needed. □ 

Example 5.3. It is easy to ‘visualize’ jP({a}) = Z; but it is already somewhat 
challenging for the free group on two generators, F({®, j/}). The best we can do is 
the following: behold the infinite graph 23 



^This is an example of the Cayley graph of a group (cf. Exercise 8.6): a graph whose vertices 
correspond to the elements of the group and whose edges connect vertices according to the action 
of generators. 
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obtained by starting at a point (the center of the picture), then branching out in 
four directions by a length of 1, then branching out similarly by a length of 1/2, 
then by 1/4, then by 1/8, then... (and we stopped there, to avoid cluttering the 
picture too much). Then every element of F({x, y}) corresponds in a rather natural 
way to exactly one dot in this diagram. Indeed, we can place the empty word at 
the center; and we can agree that every a; in a word takes us one step to the right, 
every a; -1 to the left, every y up, and every y -1 down. For example, the word 
yx~ l yx takes us here: 


• 

• 
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The reader will surely encounter this group elsewhere: it is the fundamental 
group of the ‘figure 8’. _j 

5.4. Free abelian groups. We can pose in Ab the same question answered above 
for Grp: that is, ask for the abelian group F ah (A) which most efficiently contains 
the set A, provided that we do not have any additional information on the elements 
of A. Of course we do know' something about the elements of A this time: they 
will have to commute with each other in F ai (A). This plays no role if A = {a} is 
a singleton, and therefore F ai '({o}) = F({a}) — Z; but the requirement is different 
for larger sets, so we should expect a different answer in general. 

The formalization of the heuristic requirement is precisely the same universal 
property that gave us free groups, but (of course) stated in Ab: F ab (A) is a free 
abelian group on the set A if there is a set-function j : A —> F ab (A) s-uch that, 
for all abelian groups G and set-functions / : A— ► G, there exists a unique group 
homomorphism ip : F ai (A) — ► G such that the following diagram commutes: 



A 
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Again, Proposition 1.5.4 guarantees that F oi (A) is unique up to isomorphism, 
if it exists; but we have to prove it exists! This is in some way simpler than for 
Grp, in the sense that F ab (A) is easier to understand, at least for finite sets A. 

To fix ideas, we will first describe the answer for a finite set, say A = {1, • • ■ , n}. 
We will denote by Z®“ the direct sum 

ze---ez ; 

V -f 

rr- times 

recall (§3.5) that this group ‘is the same as’ the product 24 Z n (but we view it as a 
coproduct). There is a function j . A—* Z®", defined by 

j'(*) := (0, ■ ■ • ,0, 1 , 0, • ■ ■ , 0) e Z®". 

>th place 

Claim 5.4. For A= {1, • ■ • , n), Z®" is a free abelian group on A. 

Proof. Note that every element of Z®" can be written uniquely in the form 
indeed, 

(mi,- - ,m„) = (mi, 0, • ■ ■ ,0) + (0,m 2 ,0,-- • ,0) + •• • + (0, • ,0,m„) 

= mi{l,0, • • ■ ,0) + m 2 (0,l,0, • ,0) + • • ■ + m„(0, - -- ,0,1) 

= mi j (!) + ••• +m„j(n), 

and (mi, • • • ,%) = (0, • • • , 0) if and only if all rm are 0. 

Now let / ; A — * G be any function from A = (1, - - • , n} to an abelian group G. 
We define <p : Z® n —* G by 

<P ( ) := mj /(») : 

\i=l / i=l 

indeed, we have no choice — this definition is forced by the needed commutativity 
of the diagram 



A 


and by the homomorphism condition. Tims ip is certainly uniquely determined, and 
we just have to check that it is a homomorphism. This is where the commutativity 
of G enters: 

V + V (X)”** j(*)) =5^mi/(i) + 5]m"/(«) = 5^(m' i +mJ')/(i) 

\*=1 / \i=l / i=l t=l t=l 


because G is commutative, 


= <P ^(™'« + m ?) 3(*)j = V ^ rn'i j(i) + j(i)j 


as needed. 


□ 


24 Iudeed, it is common to denote this group by Z 1 *, omitting the 0. No confusion is likely, 
but we will try to distinguish the two to emphasize that they play different categorical roles. 
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Remark 5.5. A less hands-on, more high-brow argument can be given by contem- 
plating the universal property defining free abelian groups vis-a-vis the universal 
property for coproducts; cf. Exercise 5.7. j 

Now for the general case: let A be any set. As we have seen, H A = Homs* (A, H) 
has a natural abelian group structure if H is an abelian group (§4.4); elements 
of H a are arbitrary set-functions a : A — > H. We can define a subset H oA of H A 
as follows: 


H® a := {a : A — » H | «(a) ^ e» for only finitely many elements a € A}. 


The operation in H A induces an operation in H® A , which mokes H® A into a 
group 25 . 

The reader should note that H eA is the whole of H A if A is a finite set; and 
that Z® A 3 i Z®" if A = {1, • • • , n}: indeed, (mi, • • • , m„) € Z® n may be identified 
with the function {1, • • • , «} — > Z sending i to mj. 

For H = Z there is a natural function j : A — ► Z® A , obtained by sending o e A 
to the function j a : A — * Z defined by 

{ 1 if x = a, 

0 if** a. 

Note that for A = {1, • ■ ■ ,n} and identifying Z® A Sf Z® n , this function j is the 
same function denoted j earlier. 

Proposition 6.6. For every set A, F at (A) = Z mA . 


Proof. The key point is again that every element of Z® A may be written uniquely 
as a finite sum 


Y m a j(a), m a / 0 for only finitely many a; 

a£A 


once thin w understood, the argument is precisely the same a# for Claim 5 ,4. □ 


25 Thus H® a is a subgroup of H A \ cf. §6, 
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Exercises 


5.1. Does the category defined in §5.2 have final objects? If so, what are they? 

5.2. Since trivial groups T are initial in Grp, one may be led to think that (e, T) 
should be initial in for every A : e would be defined by sending every element 
of A to the (only) element in T; and for any other group G, there is a unique 
homomorphism T —* G. Explain why (e,T) is not initial in (unless A = 0). 

5.3. > Use the universal property of free groups to prove that the map j : A F(A) 
is injective, for all sets A. (Hint: It suffices to show that for every two elements 
«, b of A there is a group G and a set- function / : A -* <? such that /(o) ^ f(b). 
Why? How do you construct / and £??) [§III.6.3] 

5.4. t> In the ‘concrete’ construction of free groups, one can try to reduce words 
by performing cancellations in any order; the process of ‘elementary reductions’ 
used in the text (that is, from left to right) is only one possibility. Prove that the 
result of iterating cancellations on a word is independent of the order in which the 
cancellations are performed. Deduce the associativity of the product in F(A) from 
this. [§5.3] 

5.5. Verify explicitly that H® A is a group. 

5.6. t> Prove that the group f p ({a.’,y}) (visualized in Example 5.3) is a coproduct 
Z*Z of Z by itself in the category Grp. (Hint: With due care, the universal property 
for one turns into the universal property for the other.) [§3.4, 3.7, 5.7] 

5.7. > Extend the result of Exercise 5.6 to free groups F({ari , . . . , r„}) and to free 
abelian groups F oi, ({xi, . . . ,#„}). |§3.4, §5.4] 

5.8. Still more generally, prove that F(AIIS) = F(A)*F(B) and that F°*'(AIIS) = 
F ah (A) © F ab (B) for all sets A , B. (That is, the constructions F, F ak ‘preserve 
coproducts’.) 

5.9. Let G = Z® K . Prove that G x G =* G. 

5.10. -> Let F = F at (A). 

• Define an equivalence relation ~ on F by setting /' ~ / if and only if /— /' = 2g 
for some g € F. Prove that F/~ is a finite set if and only if A is finite, and in 
that case |F/~| = 2^. 

• Assume F at (B) = F ab (A). If A is finite, prove that B is also, and that A = B 
as sets. (This result holds for free gronps as well, and without any finiteness 
hypothesis. See Exercises 7.13 and VI. 1.20.) 

{7.4, 7.13) 
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6. Subgroups 

6.1. Definition. Let (G, •) be a group, and let ( H , •) be another group, whose 
underlying set H is a. subset of G. 

Definition 6.1. (ff, •) is a subgroup of G if the inclusion function i : H G is a 
group homomorphism. _i 

For example, the trivial group consisting of the single element ec is a subgroup 
of G. 

If (ff, •) is a subgroup of (G, •), then Vh l5 h? £ H: 

(*) i(hi • h 2 ) = i{hi) • i(h 2 ). 

We say that the operation • on H is 'induced’ from the operation ■ on G; in practice 
one omits explicit mention of i and of the operations, and (*) guarantees that no 
ambiguity will arise from this. 

The subgroup condition may be streamlined. A subset ff of a group G deter- 
mines a subgroup if the operation ■ in G induces (by (*)) a binary operation in H 
(we say that H is closed with respect to the operation in G), satisfying the group 
axioms. Since the identity and inverses are preserved through homomorphisms 
(Proposition 3.2), the identity ejj of ff will have to coincide with the identity cq 
of G and the inverse of an element he H has to be the same as the inverse of that 
element in G. The most economical way to say all this is 

Proposition 6.2. A nonempty subset H of a group G is a subgroup if and only if 

(Vo, b 6 ff) : ab- 1 € ff. 

Proof. It is clear that if ff is a subgroup, then the stated condition holds: indeed, 
if 6 € ff, then the inverse of b must also be in ff and ff is closed under the operation 
of G. 

Conversely, assume the stated condition holds; we have to check that ff is 
closed under the operation of G, the induoed operation on ff is associative, and 
it admits an identity element and inverses (that is, it contains eg and is closed 
under taking inverses in G). Since ff is nonempty, we can find an element h e H. 
Choosing a = b= h, we see that 

ea = hh ~ 1 = a6“ 1 € ff ; 

thus ff contains the identity. Given any he H, choosing a = ea and b = h shows 
that 

h~ l = eahT 1 = oft -1 e ff; 

thus H contains the inverse of any of its elements. Given any h\,h/i € ff, choose 
a = hi,b— &T 1 ; the stated condition says that 

hxhv = h x (h 2 )- x = or 1 e ff, 
proving that ff is closed under the operation. 

Finally, the fact that the operation is associative in G implies immediately that 
the induced operation is associative in ff, concluding the proof that ff , with the 
induced operation, is a group. □ 
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This criterion makes it particularly straightforward to check simple facts con- 
cerning subgroups. For example, 

Lemma 6.3. If {H a } a £A is any family of subgroups of a group G, then 

h= n h < 

a £A 

is a subgroup of G. 

Proof. This follows right away from Proposition 6.2: H is nonempty, because 
e € H a for ail a, so e 6 H\ and 

a,b € H =$■ (Va G A) : a,b € H a G A) : afc 1 G H a ab 1 € if, 

proving that H is a subgroup of G. □ 

Similarly, 

Lemma 6.4. Let <p : G — ► G' be a group homomorphism, and let H' be a subgroup 
of G' . Then tp~ l (H') is a subgroup of G. 

Proof. Recall (end of §1.2.5) that tp~ 1 (H l ) consists of all g € G such that <p(g) € 
H'. Since <p(eci) = e G > € H' , this set is nonempty. If o, 6 € ‘p~ l (H’), then <p(a) 
and <p(b) are in H‘, and hence 

<p(afe _l ) = <p{a)ip(b)~ l G H' : 

thus, a6 _1 € <p~ l (H). This implies that <p~ l (H') is a subgroup of G, by Proposi- 
tion 6.2, □ 

6.2. Examples: Kernel and image. Every group homomorphism <p : G — *■ G' 
determines two interesting subgroups: 

• the kernel of tp, ker tp C G; and 

• the image of tp, irn^ C G'. 

Definition 6.5. The kernel of tp : G — » G' is the subset of G consisting of elements 
mapping to the identity in G 

kery := {g € G\ tp(g) = e G > } = v3 _1 ( e G')- J 

Since {e<?»} is a subgroup of G', Lemma 6.4 shows that ker 95 is indeed a sub- 
group of G. For an (even) more explicit argument, note that kery> is nonempty, 
since e G € kertp; and if a, b are in ker tp, then 

<p(ab~ l ) = <p(a)<p{b)~ l = ec-e^, 1 = e G >, 

proving that a6 _1 € kerip. This show's that kerip is a subgroup of <3, by Proposi- 
tion 6.2. 

The verification that im tp is a subgroup is left to the reader. In fact, the reader 
should check that the image of any subgroup of G is a subgroup of G'. 

We will soon (§7.1) see that kernels are ‘special’ subgroups. As with most 
constructions of importance in algebra, they satisfy a universal property, which 
may be expressed as follows. 
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Proposition 8.6. Let tp : G — * G' be a homomorphism. Then the inclusion i : 
ker tp <-» G is final in the category 26 of group homomorphisms a : K —> G such that 
tp o a is the trivial map. 

In other words, every group homomorphism a : K — » G such that tp o a is the 
trivial homomorphism (denoted ‘0’ in the diagram) factors uniquely through ker tp: 


o 



Proof. If a : K — » G is such that tp o a is the trivial map, then Vleif 

<poa(k) = tp(a(k)) = e<7, 

that is, a(k) e ker<p. We can (and must) then let K — ► ker a simply be a itself, 
with restricted target. □ 

Proposition 6.6 indicates how one might define a notion analogous to ‘kernel’ in 
very general settings. This viewpoint will be championed much later in this book, 
especially in Chapter IX. 

Remark 6.7. The argument shows that in fact kernels of group homomorphisms 
satisfy a somewhat stronger universal property: any set-function a : K -* G 
such that the image of tp o a is the identity in H must factor (as a set-function) 
through ker tp. _i 

6.3. Example: Subgroup generated by a subset. If A C G is any subset, we 
have a unique group homomorphism 

tp A : F(A) - G, 

by the universal property of free groups. The image of this homomorphism is a 
subgroup of G, the subgroup generated by A in G, often denoted 27 (A). 

Of oourse, if G is abelian, then tp A factors through F o6 (j4), so we may replace 
F{A) by F ab (A) in this case. 

The ‘concrete’ description of free groups (§5.3) leads to the following description 
of (j 4): it consists of all products in G of the form 

where each a,- is either an element of A , the inverse of an element of A, or the 
identity. This is clearly the most ‘economical’ way to manufacture a subgroup of 
G, given the elements of A. 


2 ®The reader should specify what the morphisms are in this category. 
27 If A = {91, - • -,gr} is a finite set, one writes (si , • • - ,9r). 




82 


II. Groups, first encounter 


The reader who has not (yet) developed a taste for free groups may prefer 
the following alternative description: A is the intersection of all subgroups of G 
containing A, 

(A>= p| H. 

H subgroup of G, HD A 

Indeed, the intersection on the right-hand side is a subgroup of G by Lemma 6.3, 
it contains A , and it is clearly the smallest subgroup satisfying this condition. 

If A = {<?} consists of a single element, then F(A) = Z and : Z — » G is 
nothing but the ‘exponential map’ e g (cf. §4.1); ( A ) = {g) is then the image of this 
map: 

(g) = im(e fl ) = {...,g~ 2 ,g~ l ,e,g,g 2 ,...}. 

The subgroup ( g ) is the ‘cyclic subgroup generated by g': indeed, (g) is cyclic in the 
sense of Definition 4.7; the reader can easily check this fact already (Exercise 6.4); it 
will also be recovered as an immediate consequence of the construction of quotients 
(cf. §7.5). 

Definition 6.8. A group G is finitely generated if there exists a finite subset ACG 
such that G = (A) . _i 

For examples, cyclic groups are finitely generated (in fact, they are generated 
by a singleton). By definition, a group is finitely generated if and only if there is a 
surjective homomorphism 

F({l,...,n})^G 

for some n. One of the most memorable results proven in this book will give 
a classification of finitely generated abelian groups: we will be able to prove that 
every such group is a direct sum of cyclic groups (Theorem IV.6.6, Exercise VI.2.19, 
and the generalization given in Theorem VI.5.6). The situation for general groups 
is considerably more complex. The classification of finite (simple) groups is one of 
the major achievements of twentieth-century mathematics, and it is spread over at 
least 10,000 pages of research articles. To appreciate the difference in complexity, 
note that there are 4% abelian groups of order 1024 up to isomorphism (as the 
reader will be able to establish in due time: Exercise IV.6.6); allegedly, there are 
49,487,365,402 if we count noncommutative ones as well 28 . 

6.4. Example: Subgroups of cyclic groups. We are ready to determine all 
subgroups of all cyclic groups, that is, all subgroups of E and of Z/nZ, for all n > 0 
(because every cyclic group is isomorphic to one of these; cf. Definition 4.7). The 
result is easy to remember: subgroups of cyclic groups are themselves cyclic groups. 
It is convenient to start from Z. For d € Z we let 

dZ := (d) = {m € Z 1 3$ e Z, m = dq }; 

that is, dZ denotes the set of integer multiples of d. Of course this is nothing but 
the ‘cyclic subgroup of Z generated by d\ 

Proposition 6.9. Let G C Z be a subgroup. Then G = dZ for some d> 0. 

28 This comparison is a little unfair, however, since it so happens that more than 99% of all 
groups of order < 2000 have order 1024. 
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The proof will actually show that if G C Z is nontrivial, then d is the smallest 
positive element of G, and the reader is invited to remember this useful fact. 

Remark 6-10. By Proposition 6.9, every nontrivial subgroup of Z is in fact iso- 
morphic to Z. Putting this a little strangely, it says that every subgroup of the 
free group on one generator is free. It is in fact true that every subgroup of a 
(finitely generated) free group is free; we will not prove this fact, although the dili- 
gent reader will get a taste of the argument in Exercise 9.16. In any case, beware 
that free groups on two generators already contain subgroups isomorphic to free 
groups on arbitrarily many generators. Indeed, the commutator subgroup (cf. Ex- 
ercise 7.12) (F, F] for F = F({a;, y}) is isomorphic to a free group on infinitely many 
generators (unfortunately, we will not prow this beautiful statement either). _t 

Proof of Proposition 6.9. If G = {0}, then G = 0Z. If not, note that G must 
contain positive integers: indeed, if a € G and a < 0, then -a 6 G and —a > 0. 
We can then let d be the smallest positive integer 29 in G, and we claim G = dZ. 

The inclusion dZ C G is clear. To verify the inclusion G C dZ, let m € G, and 
apply ‘division with remainder’ to write 

m = dq + r, 

with 0 < r < d. Since n € G and dZ C G and since G is a subgroup, we see that 

r = m — dq € G. 

But d is the smallest positive integer in G, and r € G is smaller than d; so r cannot 
be positive. This shows r = 0, that is, m = qd € dZ; G C dZ follows, and we are 
done. □ 

The ‘quotient’ homomorphism ir„ : Z — ► Z/nZ (cf. §4.1) allows us to es-tablish 
the analogous result for finite cyclic groups: 

Proposition 6.11. Let n > 0 be an integer and let G C Z/nZ be a subgroup. Then 
G is the cyclic subgroup ofZ/nZ generated by [d]„, for some divisor d of n. 

Proof. Let ir„ : Z — > Z/nZ be the quotient map, and consider G' := n~ 1 (G). By 
Lemma 6.4, G' is a subgroup of Z; by Proposition 6.9, G' is a cyclic subgroup of Z, 
generated by a nonnegative integer d. It follows that 

G = 7r n (G') = n n ((d)) = {[d]n); 

thus G is indeed a cyclic subgroup of Z/nZ, generated by a class [d]„. Further, 
since n € G' (because tt,, (n) = [n]„ = [0]„ € G) and G' = dZ, we see that d divides 
n, as claimed. □ 

As a consequence of Proposition 6.11, there is a bisection between the set of 
subgroups of Z/nZ and the set of positive divisors of n. Fbr example, Z/12Z has 


29 We ate secretly appealing to the ‘well-ordering principle’. That every set of positive integers 
should have a smallest element is one of those fact about Z — like the availability of division-witb- 
remainder — that we are assuming the reader is already familiar with. 
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exactly 6 subgroups, because 12 has 6 positive divisors: 1, 2, 3, 4, 6 , and 12. Here 
is the corresponding list of subgroups: 


{(llu) = {(0]ia, [I]ii,[2] l 2,(3]i2,[4]i2,[5)t 2 ,(6]t2, [7]i2,[8]j2,(9]i2i [10]i2,(ll]t2}, 
<[21i 2 > = {[0] l2> [21,2, [4], 2, [6],2, [8] 12 , [101,2}. 

(Ph) = {|0]i2> [3]i2,[6] l2 ,(9]i2}, 

(Hlu) = {[0]l2i [4]|2» [S]i2 } , 

{(6] I2 ) = {(0]i2»[ 6 1ij}> 

<[12] ia ) = {I0],2}. 


Also note that if di, <h are both divisors of n, and d\ | d^, then {[di]„} 2 {[^In)- 
That is, the correspoudence between subgroups of Z/nZ and divisors of n preserves 
the natural lattice structure carried by these sets. We can draw these lattices for 
Z/ 6 Z as follows: 



where lines connect multiples in one picture and subsets in the other. The reader 
will draw the lattice of subgroups of S 3 , noting that it looks completely different 
from the one for Z/ 6 Z. 

Contemplating subgroups of cyclic groups has pretty (and useful) ‘number- 
theoretic’ consequences; cf. Exercise 6.14. 


6.5. Monomorphisms. We end this section with some categorical considerations. 

If H is a subgroup of G, the inclusion H G is an example of a monomorphism 
in Grp in the ‘categorical’ sense of §1.4.2. In fact, it is easy to characterize all 
monomorphisms ip € Hom<*p(G, G‘) (where G, G' are any groups): 

Proposition 6.12. The following are equivalent: 

(a) p is a monomorphism; 

(h) kerip = {ea}; 

(c) (p :G G' is injective (as a set- function). 

Proof, (a) => (b): Assume (a) holds, and consider the two parallel compositions 
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where i is the inclusion and e is the trivial map. Both tpoi and tpoe are the trivial 
map; since <p is a monomorphism, this implies i = e. But i = e implies that ker ip 
is trivial, that is, (b) holds. 

(b) =4> (c): Assume ker ip = {ec}. Then 

V(9i) = 'fi(92) => <f>(gi)<p(92)~ l = ea> => vKsift 1 ) = e<=' 

=» 9192 1 € ker <p =*• gigj 1 = e G =4- gi = g 2 . 
This shows that <p is injective, as needed. 

(c) => (a): If tp is injective, then it satisfies the defining property for monomor- 
phisms in Set: that is, for any set Z and any two set-functions o', a" : Z — » G, 

<P o o' = <p o ol' 4=^ a* — a". 

This must hold in particular if Z has a group structure and a', a" are group 
homomorpliisms, so is a monomorphism in Grp. □ 

The equivalence (a) 4=*- (c) may lead the reader to think that from the point 
of view of monomorphisms. Grp and Set are pretty much alike. This is not quite so: 
while it is true that komomorphisins with a left-inverse are necessarily monomor- 
phisms, as in Set (cf. Exercise 6.15), the converse is not true in Grp (cf. Exer- 
cise 6.16). 


Exercises 


6.1. -i (If you know about matrices.) The group of invertible n x n matrices with 
entries in It is denoted GL„(R) (Example 1.5). Similarly, GL n (C) denotes the group 
of n x n invertible matrices with complex entries. Consider the following sets of 
matrices: 

• SL„(R) = (M € GL„(R) | det(M) = 1}; 

• SL„(C) = {M € GL„(C) | det(M) = 1}; 

• 0*(R) = {. M e GLn(R) | MM 1 = M*M = /„); 

• SO„(R) = {Me 0«(R) | detM = /„}; 

• U „{C) = {Me GL„(C) |MM» = M^M = /„}; 

• SU„(C) = {M € U„(C) | det(M) = 1}. 

Here /„ stands for the nxn identity matrix, M t is the transpose of M, M* is the 
conjugate transpose of M, and det(M) denotes the determinant 30 of M. Find all 
possible inclusions among these sets, and prove that in every case the smaller set 
is a subgroup of the larger one. 

These sets of matrices have compelling geometric interpretations: for example, 
SC> 3 (R) is the group of ‘rotations’ in R 3 . (8.8, 9.1, III.1.4, VI.6.16] 

30 If you are not familiar with some of these notions, that*B ok: leave this exercise and similar 
ones alone if that is the case. We will come back to linear algebra and matrices in Chapter VI 
and following. 
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6.2. -i Prove that the set of 2 X 2 matrices 

G 3 

with a, b, d in C and ad ^ 0 is a subgroup of GI<2(C). More generally, prove that the 
set of n x n complex matrices («ij)i<f^<n with ay = 0 for i > j and an ■ • • a nn ^ 0 
is a subgroup of GL„(C). (These matrices are called ‘upper triangular’, for evident 
reasons.) [IV.1.20) 

6.3. -> Prove that every matrix in STJjfC) may be written in the form 

( a +bi c + di\ 
c + di a — bij 

where a, b, c, d € R and a 2 + b 2 + <? + d? = 1. (Thus, $U2(C) may be realized as 
a three-dimensional sphere embedded in R 4 ; in particular, it is simply connected.) 
(8.9, III.2.5] 

6.4. t> Let G be a group, and let g € G. Verify that the image of the exponential 
map e s : h — ► G is a cyclic group (in the sense of Definition 4.7). [§6.3, §7.5] 

6.5. Let G be a commutative group, and let n > 0 be an integer. Prove that 
{g n | g e G} is a subgroup of G. Prove that this is not necessarily the case if G is 
not commutative. 

6.6. Prove that the union of a family of subgroups of a group G is not neoessarily 
a subgroup of G. In fact: 

• Let H, H' be subgroups of a group G. Prove that H U H' is a subgroup of G 
only if H C H' or H' C H. 

• On the other hand, let Ho C Hi C Hi C ■ ■ • be subgroups of a group G. Prove 
that (J j>0 Hi is a subgroup of G. 

6.7. -i Show that inner automorphisms (cf. Exercise 4.8) form a subgroup of 
Aut(G); this subgroup is denoted Inn(G). Prow that Inn(G) is cyclic if and only 
if Inn(G) is trivial if and only if G is abelian. (Hint: Assume that Lnn(G) is cyclic; 
with notation as in Exercise 4.8, this means that there exists an element a € G 
such that Vj e G 3ti £ Z 7 S = 7". In particular, gag -1 = a n aa~ n = a. Thus a 

commutes with every g in G. Therefore ) Deduce that if Aut(G) is cyclic, then 

G is abelian. [7.10, IV.1.5] 

6.8. Prove that an abelian group G is finitely generated if and only if there is a 
surjective homomorphism 

S ® ® 21 G 

' v ' 

n times 

for some n. 

6.9. Prove that every finitely generated subgroup of Q is cyclic. Prove that Q is 
not finitely generated. 
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6.10. -i The set of 2 x 2 matrices with integer entries and determinant 1 is denoted 
SL 2 (Z): 

SL 2 (Z) = j^ ^ such that a, b,c,d e Z, ad- bc= lj . 

Prove that SL 2 (Z) is generated by the matrices 

s= (i "o) “ ld t= (o I)’ 


(Hint: This is a little tricky. Let H be the subgroup generated by s and t. Given a 
matrix m = ^ in SL 2 (Z), it suffices to show that you can obtain the identity 


by multiplying m by suitably chosen elements of ff. Prove that 


a o 


C.-J) 


and 


are in H, and note that 


C i)(s - (: 9(., D- 


Note that if c and d are both nonzero, one of these two operations may be used 
to decrease the absolute value of one of them. Argue that suitable applications of 
these operations reduce to the case in which c = 0 or d = 0. Prove directly that 
m 6 ff in that case.) (7.5] 


6 .11. Since direct sums are coproducts in Ab, the classification theorem for abelian 
groups mentioned in the text says that ever}' finitely generated abelian group is a 
coproduct of cyclic groups in Ab. The reader may be tempted to conjecture that 
every finitely generated group is a coproduct in Grp. Show that this is not the case, 
by proving that S 3 is not a coproduct of cyclic groups. 


6.12. Let m, n be positive integers, and consider the subgroup (m, n) of Z they 
generate. By Proposition 6.9, 

(to, n) = dZ 

for some positive integer d. What is d, in relation to to, n? 


6.13. -1 Draw and compare the lattices of subgroups of C 2 x C 2 and C 4 . Draw the 
lattice of subgroups of S3, and compare it with the one for CI5. [7.1] 

6.14. > If m is a positive integer, denote by $(m) the number of positive integers 
r < m that are relatively prime to to (that is, for which the gcd of r and to is 1 ); 
this is called Euler’s <j>- (or ‘totient’) function. For example, $(12) = 4. hi other 
words, $(m) is the order of the group (Z/toZ)*; cf. Proposition 2.6. 

Put together the following observations: 

• $(m) = the number of generators of Cm, 

• every element of C„ generates a subgroup of C n , 

• the discussion following Proposition 6.11 (in particular, every subgroup of C n is 
isomorphic to C m , for some to | n), 




88 


II. Groups, first encounter 


to obtain a proof of the formula 

53 ^( m ) = ”• 

m>0,m|n 

(For example, 0(1) + 0(2) + 0(3) + 0(4) + <£(6) + 0(12) =1 + 1+ 2 + 2 + 2+4 = 12.) 
14.14, §6.4, 8.15, V.6.8, §VII.5.2] 

6.15. t> Prove that if a group homomorphism <p : G -* G' has a left-inverse, that 
is, a group homomorphism 0 : G' — » G such that 0 o <p = id#', then <p is a 
monomorphism. [§6.5, 6.16] 

6.16. t> Counterpoint to Exercise 6.15: the homomorphism <p : Z/3Z — > S 3 given 

v>(M) = (! I ^ . ..(HI) = (’ l 3 2 ) . *><PI) =(23?) 

is a mouomorphism; show that it has no left-iuverse iu Grp. (Knowing about normal 
subgroups will make this problem particularly easy.) (§6.5] 


7. Quotient groups 

7.1. Normal subgroups. Before tackling ‘quotient groups 1 , we should clarify in 
what sense kernels are special subgroups, as claimed in §6.2. 

Definition 7.1. A subgroup N of a group G is normal if Vp e G, Vn € N, 

gng~ l € N. j 

Note that every subgroup of a commutative group is normal (because then Vp € 
G, gng~ l = n € N). However, in general not all subgroups are normal: examples 
may be found already in S3 (cf. Exercise 7.1). There exist noncommutative groups 
in which every subgroup is normal (one example is the ‘quaternionic group’ Qs; 
cf. Exercise II1.1.12 (iv)), but they are very rare. 

Lemma 7.2. If : G — > G' is any group homomorphism, then ker ( p is a normal 
subgroup ofG. 

Proof. We already know that ker v> is a subgroup of G; to verify it is normal note 
that Vp € G, Vn € ker <p 

vigng -1 ) = <p(9Mn)<p{g~ l ) = p(p)e G .0(p)- 1 = e c -, 

proving that pnp -1 € kery>. □ 

Loosely speaking, therefore, kernel => normal. In fact more is true, as we 
will see in a little while; for now I don’t want to spoil the surprise for the reader. 
(Can the reader guess?) 

There is a convenient shorthand to express conditions such as normality: if 
p € G and A C G is any subset, we denote by gA, Ag , respectively, the following 
subsets of G: 


gA := {/i € G | (3a € A) : h = pa). 
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Ag := {h 6 G | (3a € A) : h = ap}. 

Then the normality condition can be expressed by 

(V ff 6 G) : piVp- 1 C iV, 
or in a number of other ways: 

gNg~ l = N or gN C Ng or gN = Ng 

for all g eG. The reader should check that these are indeed equivalent conditions 
(Exercise 7.3) and keep in mind that ‘gN = Ng’ does not mean that g commutes 
with every element of N; it means that if ne N, then there are elements n', n" e N, 
in general different from n, such that gn = n'g (so that gN C Ng) and ng = gn" 
(so that Ng C gN). 

7.2. Quotient group. Recall that we have the notion of a quotient of a set by 
an equivalence relation (§1.1.5) and that this notion satisfies a universal property 
(clumsily stated in §1.5.3). It is natural to investigate this notion in Grp. 

We consider then an equivalence relation ~ on (the set underlying) a group G\ 
we seek a group <3/~ and a group homomorphism n : G —* G/~ satisfying the ap- 
propriate universal property, that is, initial with respect to group homomorphisms 
ip : G — * G' such that a b =*> tp(a) = <p(b). 

It is natural to try to construct the group G/~ by defining an operation • on the 
set G/~. The situation is tightly constrained by the requirement that the quotient 
map v : G — » G/~ (as in §1.2.6) he a group homomorphism: for if [a] = »r(a), 
[5] = 7t(6) are elements of (7/~ (that is, equivalence classes with respect to ~), 
then the homomorphism condition forces 

[a] • [6] = 7r(a) • n(b) = ir(ab) = [at]. 

But is this operation well-defined? This amounts to conditions on the equivalence 
relation, which we proceed to unearth. 

For the operation to be well-defined ‘in the first factor’, it is necessary that if 
[a] = [a'], thee [at] = [a'6] regardless of what b is; that is, 

(Vff € G ) : a ~ a' =*■ ag ~ a'g. 

Similarly, for the operation to be well-defined in the second factor we need 
(V$ € G) : a ~ a' =*■ ga ~ ga'. 

Luckily, this is all that there is to it: 

Proposition 7.3. With notation as above, the operation 

[«].[<,]:= H 

defines a group structure on G/~ if and only if Va,a',g e G 
a ~ a' =s ga ~ ga! and ag ~ a'g. 

In this case the quotient function t : G — » G/~ is a homomorphism and is universal 
with respect to homomorphisms <p : G — * G' such that a ~ a! =$■ <p(a) = p(a'). 
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Proof. We have already noted that the condition is necessary. To prove it is 
sufficient, with the stated consequences, assume Vo, o', g € G 

a~ a! =S* ga ~ ga! and ag ~ a'g. 


Then the operation 

[a] • [6] := M 

is well-defined, and we haw to verify that it defines a group structure on G/~. The 
associativity of • is inherited from the associativity of G: Vo, 6, c € G 

(M • 1*1) • M = [“*] • bl = I(a6)c] = |a(ic)] = [a] • [6c] = [a] • ([6] • [c]). 

The class \ec\ is an identity with respect to this operation: Vg € G 

bl • \ e c\ = bee] = b]> [eel • bl = \ e C9] = bl- 
The class [g _1 ] is the inverse of b] : 

b _1 l • bl = b _1 tf) = bd, bl • b -1 ) = bs” 1 ) = be]. 

This shows G/~ is indeed a group, and we have already observed that 7r : G — » G/~ 
is a homomorphism: this is what led us to the definition of •. 

To prove that G/~ satisfies the universal property, assume 

<p : G -* G' 

is a group homomorphism such that a ~ o' =► <p(a) = <p{a'). Since (cf. §1.5.3) 
the set G/~ satisfies the corresponding universal property in Set, we know that 
there exists a unique set-function 

(f> : G/~ -> G', 

defined 31 by £({«]) := y>(«). So we only need to check that this function <p is in fact 
a group homomorphism, and this is immediate: 

£([«] • [ 6 D = £(M) = <P(ab) = <f(a)gs(b) = £([a])<p([6]) 

for all [a], [6] e G/~, as needed. □ 


We will say that ~ is compatible with the group structure of G if the condition 
given in Proposition 7.3 holds. Since the operation • on the quotient G/~ is 
uniquely determined by the operation on G, we yield to the usual abuse of language 
and omit it. If ~ is compatible, we will call G/~ the quotient group of G by ~. 


7.3. Cosets. The conditions obtained in Proposition 7.3, 


(t) 

(Vff G G) : 

o ~ 6 ga ~ gb, 

(tt) 

(Vg £ G) : 

a ~ 6 => ag ~ bg, 


lead to a complete description of all compatible relations on a group G. In fact, 
each of these two conditions leads to a description of the relations satisfying it, and 
we will analyze them separately; the reader should keep in mind that we have a 
group structure on G/~ only if both are satisfied. 

Let’s begin with (f). Here is the description: 

31 The point of §1.6.3 is precisely that this function is well-defined. 
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Proposition 7.4. Let ~ be an equivalence relation on a group G, satisfying (f). 
Then 

• the equivalence class of ea is a subgroup H of G; and 

• a ~ fr <*=£ a~ l b € if <r=i- aif = bH. 

Proof. Let if C G be the equivalence class of the identity; H jt 0 as ea e H . For 
a,be H, we have e<? ~ 6 and hence 6 -1 ~ ea (applying (f), multiplying on the left 
by 6 _1 ); hence aft -1 ~ a (by (f) again, multiplying on the left by a); and hence 

ab~ 1 ~ a ~ ea 

by the transitivity of ~ and since a e if . This shows that alt -1 e H for all a,b e H, 
proving that if is a subgroup (by Proposition 6.2). 

Next, assume a,b € G and a ~ 6. Multiplying on the left by a -1 , (f) implies 
ea ~ a" ~ I b, that is, a 1 b 6 if. Since H is closed under the operation, this implies 
a _1 bH C if, hence bH C aH\ as ~ is symmetric, the same reasoning gives aH C 
bH; and hence aH = bH. Thus, we have proved 

o~6 =» a-'beH => aH = bH. 

Finally, assume aH = bH. Then a = ae@ € bH, and hence a~ l b e if. By definition 
of if, this means e <3 ~ a -1 6. Multiplying on the left by a shows (by (f) again) that 
a ~ b, completing the proof. □ 

Proposition 7.4 shows that the equivalence classes of an equivalence relation 
satisfying (f) are in fact all in the form 

aH 

for a fixed subgroup if , as a ranges in G. These important subsets determined by 
a subgroup if deserve a name. 

Definition 7.5. The left-cosets of a subgroup if in a group G are the sets aH, for 
a 6 £?. The right-cosets of if are the sets Ha, a 6 £?. j 

Now, a ‘converse’ to Proposition 7.4 holds: 

Proposition 7.6. If H is any subgroup of a group G, the relation ~l defined by 
(Va, 6 € G) : 4=> o _1 6eif 

is an equivalence relation satisfying (f). 

Proof. This is straightforward and is mostly left to the reader (Exercise 7.8). To 
see tliat the relation satisfies (f), note that 

=» a _1 6 € H =» a~ 1 (g~ 1 g)b € H (ga) -1 (96) € ff => ga~Lgb 

for all g e G. □ 

Taken together, Propositions 7.4 and 7.6 show 

Proposition 7.7. There is a one-to-one correspondence between subgroups of G 
and equivalence relations on G satisfying (f); for the relation corresponding to 
a subgroup H, G/~l may be described as the set of left- cosets aH of H. 
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The reader should have no difficulty producing the mirror statements (and 
proofs) giving a similarly exhaustive description of all equivalence relations satisfy- 
ing (tt)- The end result will be 

Proposition 7.8. There is a one-to-one correspondence between subgroups of G 
and equivalence relations on G satisfying (ft); for the relation ~r corresponding to 
a subgroup H, G/~« may be described as the set of right-cosets Ha of H. 


The relation corresponding to H in this second way is defined by 
a ~r 6 ab~ l € H Ha = Hb. 


What may be surprising at first is that the relations and ~i? corresponding 
to the same subgroup H may very well not be the same relation. That is, left-cosets 
and right-cosets of a subgroup need not coincide. Of course eH = He = H, and 
more generally 

(V/i € H ) : hH = Hh = H. 


Further 


(Vo € G) : a € aH H Ha', 

hence, if aH = Hb, then in fact necessarily aH = Ha. This is of course automati- 
cally true if G is commutative, but it is simply not the case in general. 


Example 7.9. Let G = S 3 , and let H be the subgroup consisting of the identity 
and the 1 «-» 2 switch: 



MG 

to to 

a-G 

L 2 
! 1 

3)}' 

Then 

G ; an 

Y 1 

2 ; 

i\ A 

2 



1 5 

A A 

; 2 lj] 

while 

II 

CQ H 

tt! 

Y 1 

As 

2 i 

1 i 

D-G 

If)} 


This state of affaire simply reflects the fact that the two conditions (f) and (ft) 
are different: there is no reason to expect that if one holds, the other one should 
also hold (unless G is commutative, of course) . Once more, keep in mind that both 
have to hold for the quotient G/~ to be a group, compatibly with the operation in 
G; cf. Proposition 7.3. 


7.4. Quotient by normal subgroups. To stress the main point again, an arbi- 
trary subgroup H of G leads to two partitions of G, which we have denoted 

G/~ l = {atf|a€G}, G/~« = {tfa|ae G}. 

The relation satisfies property (f) listed at the beginning of §7.3; satisfies 
(ff). A priori, these relations, and hence the corresponding partitions, are different. 

The condition that ~l and coincide (as is necessarily the case, for example, 
if G is commutative) translates into a condition on H: for such ‘special’ subgroups, 
(f) and (ff) get back together. The good news is that this condition is easy to 
identify and is not new to the reader. 
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Proposition 7.10. The relations ~£, corresponding to a subgroup H coincide 
if and only if H is normal. 

Proof. Two relations coincide if the corresponding partitions agree. Therefore 
■<=*• left- and right-cosets of H coincide <=>• (Vg € G ) : gH = Hg. 

But this is one of the equivalent conditions defining the notion of normal subgroup 
(cf §7.1), proving the statement. □ 

The innocenHooking Proposition 7.10 is of fundamental importance. If H is 
normal, then the one equivalence relation ~=~l=~r. corresponding to H satisfies 
both (f) and (f j) (by Proposition 7.4 and its mirror statement), and hence (by 
Proposition 7.3) the quotient 

G/~ = {aH | a 6 G} = {Ha | a e G} 
has a natural group structure. 

Definition. 7.11. Let H be a normal subgroup of a group G. The quotient group 
of G modulo H, denoted 32 G/H, is the group G/~ obtained from the relation ~ 
defined above. In terms of (left-) cosets, the product in G/H is defined by 

(aH)(bH) := {ab)H. 

The identity element ea/H of the quotient group G/H is the coset of the identity, 
eoH = H. j 


By Proposition 7.3, the quotient function 

n :G —* G/H 

sending g € G to gH = Hg is a group homomorphism and is universal with respect 
to group homomorphisms p : G —* G’ such that aH = bH =*■ ip(a) = <p(b). This 
universal property is extremely useful, so we will grace it with theorem status: 

Theorem 7.12. Let H be a normal subgroup of a group G. Then for every group 
homomorphism ip : G — » G' such that H C ker^ there exists a unique group 
homomorphism ip : G/H —* G' so that the diagram 

G >G' 


G/H 

commutes. 

Proof. We only need to match the stated universal property with the one we 
proved in Proposition 7.3, and indeed, 

H C ker <p -4=> (Vh 6 H ) : <p(h) = ec 

is equivalent to 

(Va, be G) : ab~ l e H =*• <p(«i -1 ) = ea> 




32 In a large display we Bometime use the full 'fraction' notation jf. 
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that is, to 

(Va, b e G) : ab~ 1 6 H =» <p(a) = ip(b) 
and finally, keeping in niiud how the relation ~ corresponding to AT is defined, 

(Va, b e G) : a ~ b p(a) = <p(b), 

the condition giving the universal property in Proposition 7.3. □ 

7.5. Example. The reader is already very familiar with an important class of 
examples: the cyclic groups Z /nZ. Indeed, in §2.3 we defined Z/nZ as the set of 
equivalence classes in Z with respect to the congruence equivalence relation 

(Va, b € Z) : a=b mod n ■4=*- n. | (i> — a). 

Now we recognize that n | (6 - a) is equivalent to 

6 — a € nZ, 

which is the relation ~i corresponding (in ‘abelian’ notation) to the subgroup nZ 
of Z. This subgroup is of course normal, since Z is abelian. The ‘congruence classes 
mod n’ are nothing but the cosets of the subgroup nZ in Z; using abelian notation 
for cosets, we could write 

M« = a + (nZ). 

Of course the operation defined on Z/raZ in §2.3 matches precisely the one defined 
above for quotient groups. This justifies the notation Z/nZ introduced in §2.3. 

The reader can already appreciate in this simple context the usefulness of The- 
orem 7.12. Let g € G be an element of order n and consider the exponential map 

e g :Z ->G, N»g N . 

By Corollary 1.11, 

ker e 9 = {IV e Z | JV is a multiple of |<?|} = nZ. 

Theorem 7.12 then implies right away that e 9 factors through the quotient: 



That is, there is an induced map 

Z/nZ — * (g). 

In fact, the ‘canonical decomposition’ of §1.2.8 implies that this is an isomorphism 
(verifying that (fit) is cyclic in the sense of Definition 4.7, as the reader should have 
checked ‘by hand’ already in Exercise 6.4). We will formalize this observation in 
general in the next section. 

Also note that |g| = n = |{$)| in this case. 
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7.6. kernel <=4- normal. If if is a normal subgroup, we have now constructed 
in gory detail a group G/H and a surjective homomorphism 

ir : G — > G/H. 

What is the kernel of ir? The identity of G/H is the coset. ec;//, that is, H itself. 
Therefore 

kerir = {g<=G\gH=H} = H. 

This observation completes the circle of ideas begun in §7.1: there we had noticed 
that every kernel (of a group homomorphism) is a normal subgroup; and now 
we have verified that every normal subgroup is in fact a kernel (of some group 
homomorphism). We encapsulate this in the slogan 

kernel -4=>- normal : 

in group theory 33 , ‘kernel’ and ‘normal subgroup’ are equivalent concepts. 

For example, every subgroup in an abelian group is the kernel of some homo- 
morphism: yet another indication that life is simpler in Ab than in Grp. 


Exercises 


7.1. > List all subgroups of S3 (cf. Exercise 6.13) and determine which subgroups 
are normal and which are not normal. (§7.1) 

7.2. Is the image of a group homomorphism necessarily a normal subgroup of the 
target? 

7.3. > Verify that the equivalent conditions for normality given hi §7.1 are indeed 
equivalent. [§7.1J 

7.4. Prove that the relation defined in Exercise 5.10 on a free abelian group F = 
F°*(.A) is compatible with the group structure. Determine the quotient F/~ as a 
better known group. 

7.5. -> Define an equivalence relation ~ on SL2(Z) by letting A ~ A' •$=> A 1 = 
±A. Prove that ~ is compatible with the group structure. The quotient SL2(Z)/~ 
is denoted PSLz(Z) and is called the modular group; it would be a serious contender 
in a contest for ‘the most important group in mathematics’, due to its role in 
algebraic geometry and number theory. Prove that PSLa(Z) is generated by the 
(cosets of the) matrices 

(! ~o) ^ (I "£)■ 

(You will not need to work very hard, if you use the result of Exercise 6.10.) Note 
that the first has order 2 in PSL2(Z), the second has order 3, and their product has 
infinite order. (9.14) 

^We will run into analogous observations in ring theory, where we will verify that kernels 
and tdeai s coincide, and for modules, as kernels and submodules again coincide. 
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7.6. Let G be a group, and let n be a positive integer. Consider the relation 

(3 g€G)ab- 1 = g n . 

• Show that in general ~ is not an equivalence relation. 

• Prove that ~ is an equivalence relation if G is commutative, and determine the 
corresponding subgroup of G. 

7.7. Let G be a group, n a positive integer, and let H C G be the subgroup 
generated by all elements of order n in G. Prove that H is normal. 

7.8. > Prove Proposition 7.6. [§7.3] 

7.9. State and prove the ‘mirror’ statements of Propositions 7.4 and 7.6, leading 
to the description of relations satisfying (ft). 

7.10. -i Let G be a group, and H CG a subgroup. With notation as in Exercise 6.7, 
show that H is normal in G if and only if Vq e Imi(G), -y (H) C H. 

Conclude that if H is normal in G, then there is an interesting homomorphism 
Inn(G) -» Aut(ff). [8.25] 

7.11. o Let G be a group, and let [G,G\ be the subgroup of G generated by all 
elements of the form aba~*b~*. (This is the commutator subgroup of G; we will 
return to it in §IV.3.3.) Prove that [G, G) is normal in G. (Hint: With notation 
as in Exercise 4.8, g • aba ■ g _1 = "f g (aba~ 1 b~ 1 ).) Prove that G/[G,G] is 
commutative. [7.12, §IV.3.3[ 

7.12. > Let F — F(A) be a free group, and let / : A — > G be a set-function 
from the set A to a commutative group G. Prove that / induces a unique homo- 
morphism F/[F, F] — » G, where [F, F) is the commutator subgroup of F defined 
in Exercise 7.11. (Use Theorem 7.12.) Conclude that F/[F, F] = F at (A). (Use 
Proposition 1.5.4.) (§6.4, 7.13, VI. 1.20] 

7.13. -> Let A, B be sets and F(A), F(B) the corresponding free groups. Assume 
F(A) = F{B). If A is finite, prove that B is also and A = B. (Use Exercise 7.12 
to upgrade Exercise 5.10.) [5.10, VI.1.20) 


8. Canonical decomposition and Lagrange’s 
theorem 

We will collect in this section a number of observations on the structure of quotient 
groups. All these results are straightforward, given the background work done so 
far. Some of them are often given fancy names such as first isomorphism theorem 
in the literature; we are not too fond of such terminology: the universal property 
proven in Theorem 7.12 is really the only thing we need to take along, and it serves 
us wonderfully well. The ‘isomorphism theorems’ are all immediate applications of 
this universal property. 
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8.1. Canonical decomposition. The first observation comes from the canoni- 
cal decomposition for set-functions, obtained in §1.2.8: every set-functions may be 
viewed as the composition of a surjective map, fallowed by a bijective map, followed 
by an injective map. We now know enough to state the corresponding (very useful) 
results in Grp: 

Theorem 8.1. Every group homomorphism <p : G — » G' may be decomposed as 
follows: 





where the isomorphism <p in the middle is the homomorphism induced by <p (as in 
Theorem 7.12). 

It is important that the reader agree that we have already proved anything 
that deserves to be proven here. We know that the projection on the left and the 
inclusion on the right are homomorphisuis and <p comes from Theorem 7.12. The 
decomposition is the same one obtained at the level of set- functions in §1.2.8; in 
particular, the function in the middle is a bisection. Since bijective homomorphisms 
are isomorphisms (Proposition 4.3), it is an isomorphism. 

Theorem 8.1 should induce the following Pavlovian reaction: exposed to any 
group homomorphism <p :G -* G\ the reader should instantaneously view G / ker p 
as (canonically identified with) a subgroup of G' . What is usually called the ‘first 
isomorphism theorem’ is the particular case corresponding to surjective homomor- 
phisms: 

Corollary 8.2. Suppose tp : G —* G' is a surjective group homomorphism. Then 



Proof, irmp = G' in Theorem 8.1. □ 

This result is very useful — it comes in extremely handy when proving that two 
groups are isomorphic, both in theoretical contexts (as we will see in the rest of 
this section) and in concrete instances. 

Example 8.3. If Hi C G\ and Ui C G 2 , then the product H\ x Hi may be 
viewed as a subset of Gi x G 2 . It is clear that if G\, G 2 are groups and Hi, H? 
are subgroups, then Hi x H 2 is a subgroup of G\ x G 2 . The following claim is a 
prototype application of Corollary 8.2: 

Claim 8.4. If H\ C Gi and H 2 C G 2 are normal subgroups, then Hi x H 2 is o 
normal subgroup of the group G 1 x G 2 and 

Cixff^ Gi G2 

Hi x H 2 ~ Hi H 2 ' 

Indeed, composing the projections 

ff] : Gj x G 2 — > Gi, it 2 : G\ x G 2 — * G 2 
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with the morphisms to the quotients gives surjective homomorphisms 


1T\ : Gy X G 2 


Gi 

H x ' 


jt 2 : Gi x G 2 


G 2 

H 2 


and hence a homomorphism 

7T : Gi x G^ 


G\ G 2 
H^ #2 


by the universal property of products. Explicitly, 


M 91,92 ) = (<7iHi,£ 2 H 2 ) : 
in particular, n is surjective and 


kerff = {( 51 , 52 ) € Gi x G 2 1 (piHi,s 2 H 2 ) = (Hi,H 2 )} 
= {(9it92) € Gi x G 2 |ai € Hi , 52 € H 2 } 

= Hi x H 2 . 


The claim then follows immediately from Corollary 8.2. 

The result (of course) extends to more factors in the product. Any such check 
should become second nature and is usually left to the reader. _i 


Example 8.5. As a particular case of Claim 8.4, take Hi = {ec,} C Gi and 
H 2 = G 2 C G 2 : 

q i * <?2 ~ G i .. £2 
G 2 ( e Ci} G 2 *’ 

where on the left we identify G 2 with the subgroup {e^ } x G 2 . For instance 34 
(cf. §4.1) 

G6^G 2 xC 3 
c~ 3 = ~CT~ = c * 

Example 8 . 6 . The cyclic group C 3 may be viewed as a subgroup of the dihedral 
group D$: the rotations of a triangle give a copy of C 3 inside D$. Then C 3 is 
normal in £> 6 , and 

D e ~ r 

This can of course be checked ‘by hand’. But note that there is an evident surjective 
homomorphism De — » C 2 , whose kernel is C 3 : map an element o of Dg to the 
identity in C 2 if it does not flip the triangle (that is, precisely when tr 6 C 3 ), 
and map it to the other element if it does. Corollary 8.2 implies the stated facts 
immediately. j 

Example 8.7. One can give a circle (denoted S 1 ) a group structure by identifying 
its points with rotations of a plane about a point and adding them accordingly. 
The function 

p : K* — > S 1 


34 Abuses of language such as the formula which follows— in which one is not explicitly spec- 
ifying how to realize C3 as a subgroup of C& , because there is really only one way to do it — are 
unfortunately commonplace. 
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mapping a number r to the result of a rotation by 2 irr radians is then a surjective 
group homomorphism; this is the identity precisely when we rotate by an integer 
multiple of 2ir. Hence 

kerp = ZCR. 

By Corollary 8.2, therefore. 



(Cf. Exercise 1.1.6.) Geometrically, this amounts to ‘wrapping’ R infinitely many 
times around the circle, realizing R as the ‘universal cover’ of S 1 ; here, Z plays the 
role of ‘fundamental group’ of 5*. j 

8.2. Presentations. Every group is a quotient of a free group, and every abelian 
group is a quotient of a free abelian group. Indeed, every group G can be surjected 
upon by a free group, and in many ways (at the very least, F(G) will do!). Abelian 
groups may be likewise surjected upon by free abelian groups. Then Corollary 8.2 
produces the needed isomorphism of G with a quotient of a free group. 

A presentation of a group G is an explicit isomorphism 

g -~r- 

where A is a set and J? is a subgroup of ‘relations’. la other words, a presentation 
is an explicit surjection 

p : F{A) G 

of which R is the kernel. This is especially useful if A is small, and R may be 
described very explicitly; usually this is done by listing ‘enough’ relations, that is, 
a set Ft of words r a e R = ker p generating it in the sense that R is the smallest 
normal subgroup 35 of F(A) containing SI. 

Thus, a presentation of a group G is usually encoded as a pair (A\£?), where A 
is a set and Sf. C F(A) is a set of words, such that G = A/R with R as above. 

A group is finitely presented if it admits a presentation {A, SI) in which both 
A and SI are finite. Finitely presented groups are not (necessarily) ‘small’: for 
example, the free group on finitely many generators is (trivially) finitely presented. 

We have already run into several examples of presentations. For instance, 
the free group F(A) is presented by (A|0). More interestingly, the description of 
S 3 given in §2.1 ‘presents’ S3 as a quotient of the free group F({*, j/}) (cf. Ex- 
ample 5.3) by the smallest normal subgroup containing x 2 , p 3 , and yx = xy 2 : 
(a;, ylx 2 ,#*, xyxy) in shorthand. From this point of view, it is dear that groups 
admitting the same presentation (example: S3 and Ds) are isomorphic. 

The situation is less idyllic than it may seem at first, though: even if a presen- 
tation of a group G is known, it may be wry hard to establish whether two explicit 


35 Note that this is a different requirement than the one adopted in §6.3, in which normality 
plays no role. 
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combinations of the generators coincide in G. This is known as the word problem, 
and it has been shown to be undecidable in general 36 . 

In any case, now that we know about presentations of groups, finding coprod- 
ucts in Grp should be straightforward: see Exercise 8.7. 

There is a ‘mirror’ statement analogous to the fact that all groups are quotients 
of free groups: every group may be realized as a subgroup of a symmetric group. 
This elementary observation goes under the name of Cayley’s theorem; its natural 
place is within the discussion of group actions (cf. Theorem 9.5). 

8.3. Subgroups of quotients. The lattice of subgroups (cf. §6.4) of a quo- 
tient G/H can be described very explicitly in terms of the lattice of subgroups 
of the group G: simply keep the part of the lattice of subgroups of G corresponding 
to subgroups which contain H. 

Example 8.8. Here is the effect of this operation on the lattice of subgroups of 
C\i = 2/12Z (labeled by generators; cf. §6.4), after quotienting by H = ([6]) = C^: 

[ 1 ] 



The result matches the lattice of subgroups of Cf, = C12/C2. j 

Here is why this works. First note that if H C K are subgroups of a group G 
and H is normal in G, then H is normal in K. 

Proposition 8.9. Let H be a normal subgroup of a group G. Then for every 
subgroup K of G containing H, K/H may be identified with a subgroup of G/H. 
The function 

u : {subgroups K of G containing H} — ► {su^roMps of G/H} 
defined by u(K) = K/H is a bijection preserving inclusions. 

Proof. The group K/H consists of the cosets aH € G/H with a € K, and in 
this sense it is a subset (and clearly a subgroup) of G/H. It is also clear that if 
H C. K C L, then u(K) = K/H C L/H = u{L); that is, u preserves inclusions. 


36 That is, there is no general algorithm that, given a presentation of a group G and two 
words in the generators, will establish (in a finite time) whether those two words represent the 
same element of G. 
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Thus we simply have to verify that u is a bisection, and for this it suffices to 
produce an inverse function 

v : {subgroups of G/H} — ► {subgroups K of G containing H}. 

Then let K' be a subgroup of G/H\ define v(K') to be the subset of G: 

K := n-'iK') = {a € G|aff € K'}, 

where ir : G G/H is the canonical projection. Then If is a subgroup of G (by 
Lemma 6.4) and contains H (because H = ir _1 (e) and e e K'). The reader will 
check that u and v are inverses of each other. □ 

In fact, the correspondence is even nicer, in the sense that it preserves normal- 
ity. The following statement is often called the third isomorphism theorem: 

Proposition 8.10. Let H be a normal subgroup of a group G, and let N be a 
subgroup of G containing H. Then N/H is normal G/H if and only if N is 
normal in G, and in this case 

G/H G 
N/H ~ N' 

Proof. If N is normal, then consider the projection 



the subgroup H is contained in N, which is the kernel of this homomorphism, so 
we get (by the universal property of quotients, Theorem 7.12) an induced homo- 
morphism 

£ £ 

H N' 

The subgroup N/H of G/H is the kernel of this homomorphism; therefore it is 
normal- 

Conversely, if N/H is normal in G/H, consider the composition 

^ G G/H 
H~* N/H ' 

The kernel of this homomorphism is N; therefore N is normal. Further, this ho- 
momorphism is surjective; hence the stated isomorphism ( G/H)/(N/H ) = G/N 
follows immediately from Corollary 8.2. □ 

8.4. HK/H vs. K/(HnK). Section 8.3 deals with two ‘nested’ subgroups H C K 
of a group G. What if H, K are not nested? 

The notation introduced in §7.1 extends to subsets of G: if A C G, B C G, 
then AB denotes the subset 

AB := {a6|ae A, be B}. 

It would be nice if HK were guaranteed to be a subgroup of G as soon as H and K 
are subgroups, but this is simply not the case in general, if G is not commutative. 
It is, however, the case if one of the subgroups is normal. The following is often 
called the second isomorphism theorem. 
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Proposition 8.11. Let H, K be subgroups of a group G, and assume that H is 
normal in G. Then 


• HK is a subgroup of G, and H is normal in HK; 

• H l"l K is normal in K, and 

HK ^ K 
H ~ HC\K‘ 


Proof. Tb verify that HK is a subgroup of G when H is normal, note that HK is 
the union of all cosets Hk, with k S K\ that is, 

HK = n~ l (rr(K)), 

where ir : G — * G/H is the canonical projection. Since n(K) is a subgroup of G/H, 
HK is a subgroup by Lemma 6.4. It is clear that H is normal in HK. 

For the second part, consider the homomorphism 

V> : K -> HK/H 

sending k 6 K to the coset Hk (that is, the inclusion K «-* HK followed by the 
canonical projection to the quotient). This is surjective: indeed, every element of 
HK/H may be written as a coset 


Hhk, he H,keK; 

but Hhk = Hk, so Hhk = <p(k) is in the image of tp. By the omnipresent Corol- 
lary 8,2, 


HK^K 
H ker 


What is ker <p‘! 

Very = {* € AT | = e} = {fc 6 K\ Hk = H} = {k 6 K \ k € if} = H n K, 

with the stated result. □ 


8.5. The index and Lagrange’s theorem. The notation G/H is used to denote 
the set of left-cosets 37 of H, even when H is not normal in G. Thus G/H is a set 
in general, and it is a group when H is in fact normal in G. 

Definition 8.12. The index of H in G, denoted [G : H], is the number of elements 
\G/H\ of G/H, when this is finite, and oo otherwise. _i 

Thus, [G : H\ (if finite) denotes the number of lefb-cosets of H in G, regardless 
of whether H is normal in G. 

Lemma 8.13. Let H be a subgroup of a group G. Then 'ig eG the functions 

H — t gH, h * — ► gh, 

H —* Hg, hi-* hg 

are bijections. 

^~This may seem an arbitrary choice (why not rrj/il-cosets?). It is. Writing from left to right 
gives us a bias towards ie/t-actions, and G acts nicely on the left on the set of (e/t-cosets; this will 
make better sense when we get to Example 9.4. In any case, there is a byection between the set 
of left-cosets and the set of right-coset: Exercise 9.10. 
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Proof. Both functions are surjective by definition of coset. Cancellation implies 
that they are injective. □ 

Corollary 8.14 (Lagrange’s theorem). If G is a finite group and H C G is a 
subgroup, then |G| = [<3 : H] ■ |ff|. In particular, |//| is a divisor of |G|. 

Proof. Indeed, G is the disjoint union of | Gj H | distinct cosets gH, and \gH\ = |ff | 
by Lemma 8.13. □ 


Lagrange’s theorem is more useful than it may appear at first. 

Example 8.15. The order |p| of any element g of a finite group G is a divisor of 
|f?|: indeed, |$| equals the order of the subgroup (g) generated by g. 

Note: Therefore, = eg for all finite groups G, all g e G. _i 

Example 8.16. If |G| is a prime integer p, then necessarily G = X/p'L. 

Indeed, let g e G be any element other than the identity; then {g) is a subgroup 
of G, of order > 1. By Lagrange’s theorem, |{<?}| = p = |G|; that is, G = (g) is 
cyclic of order p, as claimed. -j 


Example 8.17 (Fermat’s little theorem). Let p be a prime integer, and let a be 
any integer. Then aP = a mod p. 


Indeed, this is immediate if a is a multiple of p; if a is not a multiple of p, then 
the class [o] p modulo p is nonzero, so it is an element of the group (Z/pZ)*, which 
has order p - 1. Tlrus 


mt = Mp 

(Example 8.15); hence [a]P = [a] p as claimed. 


-i 


Warning: However, do not ask too much of Lagrange’s theorem. For example, 
it does not say that if d is a divisor of |G|, then there exists a subgroup of G of 
order d (the smallest counterexample is A\, a group of order 12, which does not 
contain subgroups of order 6; the reader will verify this in Exercise IV .4. 17); it does 
not even say that if p is a prime divisor of |G|, then there is an element of order 
p in G. This latter statement happens to be true, but for ‘deeper’ reasons. The 
abelian case is easy (cf. Exercise 8.17). The general case is called Cauchy’s theorem, 
and we will deal with it later on (cf. Theorem IV.2.1). 

The index is a well-behaved invariant. It is clearly multiplicative, in the sense 
that if H C K are subgroups of G, then 


[G-.H] = [G:K].[K:H], 


provided that these numbers are finite. Also, if if and K are subgroups of G and H 
is normal (so that HK is a subgroup as well; cf. Proposition 8.11), then 


\HK\- 


|g|-|g| 

\HC\K\ 


(again, provided this has a chance of making sense, that is, if the orders are finite): 
this follows immediately from the isomorphism in Proposition 8.11 and index con- 
siderations. In fact, the formula holds even without assuming that one of the 
subgroups is normal in G. Do you see why? (Exercise 8.21.) 
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Ehrther, if H and G are finite, then Lemma 8.13 implies immediately that the 
index of H in G, defined as the number of left - cosets of H in G, also equals the 
number of right - cosets of H. It is in fact easy to show that there always is a bisection 
between the set G/H of lefb-cosets and the set of right-cosets (cf. Exercise 9.10), 
regardless of finiteuess hypotheses. The set of right-cosets of H in G is often 
(reasonably) denoted H\G. 

8.6. Epimorphisms and cokernels. The reader may expect that a mirror state- 
ment to Proposition 6.12 should hold for group epimorphisms. This is almost true: 
a homomorphism tp : G H is an epimorphism (in the category Grp) if and only 
if it is surjective. However, while one implication is easy, the proofs we know for 
epimorphism ==» surjective in Grp are somewhat cumbersome. 

The situation is leaner (as usual) in Ab: there is in Ab a good notion of cokemel; 
this is part of what makes Ab an ‘abelian category’. 

As is often the case, the reader may now want to pause a moment and try 
to guess the right definition. Keeping in mind the universal property for kernels 
(Proposition 6.6), can the reader come up with the universal property defining 
‘cokemels’ V Can the reader prove that these exist in Ab and detect epimorphisms? 
Don’t look ahead! 

Here is how the story goes. The universal property is (of course) obtained by 
reversing the arrows in the property for kernels: given a homomoiphism tp : G -> G' 
of abelian groups, we want an abelian group coker tp equipped with a homomorphism 

it : G' -* coker <p 

which is initial with respect to all morphisms ct such that ao tp = 0. That is, every 
homomorphism a : G — * L such that aotp is the trivial map must factor (uniquely) 
through coker <p: 


o 



coker 

Cokernels exist in Ab: because the image of ^ is a subgroup of G\ hence a normal 
subgroup of G' since G' is abelian; the condition that a o tp is trivial says that 
im tp C ker a, and hence 

G' 

s coker p 

im tp 

satisfies the universal property, by Theorem 7.12. 

The ‘problem’ in Grp is that im<p is not guaranteed to be normal in G'\ thus 
the situation is more complex. 

Also note that, in the abelian case, G' f imp automatically satisfies a stronger 
universal property: as stated, but with respect to any set-function G' — * L which 
is constant on cosets of im tp. 

We can now state a true mirror of Proposition 6.12, in Ab: 
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Proposition 8.18. Let <p : G — > G' be a homomorphism of abelian groups. The 
following are equivalent: 

(a) ip is an epimorphism ; 

(b) coker <p is trivial; 

(c) ip:G —* G' is surjective (as a set-function). 

Proof, (a) =?- (b): Assume (a) holds, and consider the two parallel compositions 

G — coker <p , 

e 

where ir is the canonical projection and e is the trivial map. Both irotp and eop are 
the trivial map; since ip is an epimorphism, this implies w = e. But n = e implies 
that coker <p is trivial, that is, (b) holds. 

(b) ==> (c): If cokers = G' /im tp is trivial, then im^ = G'\ hence <p is 
surjective. 

(c) ==> (a): If ip is surjective, then it satisfies the universal property for epi- 
morphisms in Set: for any set Z and any two set-functions o' and «":(?'—» Z, 

a' o tp = o" o ip 4=^ o' — a". 

This must hold in particular if Z is endowed with a group structure and o', o" are 
group homomorphisms, so tp is an epimorphism in Grp. □ 

A cokernel may be defined in Grp; the universal property for the cokernel of 
<p : G — ♦ G' is satisfied by G'/N, where N is the smallest 38 normal subgroup 
of G' containing im tp (Exercise 8.22). But Proposition 8.18 fails, because the 
implication (b) =*• (c) does not hold: in Grp it is no longer true that only surjective 
homomorphisms have trivial cokernel (cf. Exercise 8.23). 


Exercises 


8.1. If a group H may be realized as a subgroup of two groups G\ and Gz and if 

G i ~ G i 

H ~ H’ 

does it follows that G , = G-f! Give a proof or a counterexample. 

8.2. -» Extend Example 8.6 as follows. Suppose G is a group and H C G is a 
subgroup of index 2, that is, such that there are precisely two (say, left-) cosets 
of H in G. Prove that H is normal in G. [9.11, IV.1.16] 

8.3. Prove that every finite group is finitely presented. 


38 The intersection of any family of normal subgroups is normal, as tile reader may readily 
check, bo this subgroup exists. 
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8.4. Prove that (a,b\a 2 ,b 2 , (ob) n ) is a presentation of the dihedral group D 2n . 
(Hint: With respect to the generators defined in Exercise 2.5, set a = x and 
b = xy\ prove you can get the relations given here from the ones you obtained 
in Exercise 2.5, and conversely.) 

8.5. Let a, b be distinct elements of order 2 in a group G, and assume that ab has 
finite order n > 3. Prove that the subgroup generated by a and 6 in G is isomorphic 
to the dihedral group i? 2 n- (Use the previous exercise.) 

8.6. -• Let G be a group, and let A be a set of generators for G\ assume A is 
finite. The corresponding Cayley graph 39 is a directed graph whose set of vertices 
is in one-to-one correspondence with G, and two vertices gi, g 2 are connected by 
an edge if g 2 = Qia for an a € A; this edge may be labeled a and oriented from pi 
to P2- For example, the graph drawn in Example 5.3 for the free group P'({i, y}) 
on two generators x,y is the corresponding Cayley graph (with the convention that 
horizontal edges are labeled x and point to the right and vertical edges are labeled y 
and point up). 

Prove that if a Cayley graph of a group is a tree, then the group is free. 
Conversely, prove that free groups admit Cayley graphs that are trees. (§5.3, 9.15J 

8.7. > Let {A\AS), reap., (A'ffi), be a presentation for a group G, resp., G' (cf. §8.2); 
we may assume that A, A' are disjoint. Prove that the group G*G' presented by 

(AUi4'|^U^) 

satisfies the universal property for the coproduct of G and G' in Grp. (Use the 
universal properties of both free groups and quotients to construct natural hoino- 
morphisms G->G*G',G'-*G* <?'.) (§3.4, §8.2, 9,14] 

8.8. -i (If you know about matrices (cf. Exercise 6.1).) Prove that SL„(R) is 
a normal subgroup of CL„(R), and ‘compute’ GL„(R)/SL„(R) as a well-known 
group, [VI.3.3] 

8.9. -■ (Ditto.) Prove that SOa(R) = SU 2 (C)/{±/ 2 ), where h is the identity 
matrix. (Hint: It so happens that every matrix in S 03 (R) can be written in the 
form 

/a 2 + b 2 — c 2 — d 2 2 (be — ad) 2(ac + bd) \ 

I 2{ad + bc) a, 2 — ti 2 + c 2 — d? 2 (cd — ai) j 

\ 2(bd — ac) 2 (ab + cd) a 2 — b 2 — c 2 + d?) 

where o, 6, c, d € R and a 2 +b 2 +c 2 +d 2 = 1. Proving this fact is not hard, but at this 
stage you will probably find it computationally demanding. Feel free to assume this, 
and use Exercise 6.3 to construct a surjective homomorphism SU 2 (C) — » SOs(R); 
compute the kernel of this homomorphism.) 

If you know a little topology, you can now conclude that the fundamental 
group 40 of S0 3 (R) is C 2 . (9.1, VI.1.3] 

39 Warning: This is one of several alternative conventions. 

40 If you really want to believe this feet, remember that SOa(R) parametrizes rotations in St 3 . 
Hold a tray with a glass of water on top of your extended right hand. You should be able to rotate 
the tray clockwise by a full 360° without spilling the water, and your muscles will tell you that 
the corresponding loop in is not trivial. But then you will be able to rotate the tray (Jffdm 
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8.10. View ZxZasa subgroup of R x R: 


Describe the quotient 


R x 1 


in terms analogous to those used in Example 8.7. (Can you ‘draw a picture’ of this 
group? Cf. Exercise 1.1.6.) 

8.11. (Notation as in Proposition 8.10.) Prove ‘by hand’ (that is, without invoking 
universal properties) that N is normal in G if and only if N/H is normal in G/H. 

8.12. (Notation as in Proposition 8.11.) Prove ‘by hand’ (that is, by using Propo- 
sition 6.2) that HK is a subgroup of G if H is normal. 

8.13. -> Let G be a finite commutative group, and assume |<3| is odd. Prove that 
every element of G is a square. [8.14] 

8.14. Generalize the result of Exercise 8.13: if G is a group of order n and k is an 

integer relatively prime to n, then the function jt-*/ is surjective. 

8.15. Let a, n be positive integers. Prove that n divides <p(a n — 1), where <j> is 
Euler’s <£-function; see Exercise 6.14. (Hint: Example 8.15.) 

8.16. Generalize Fermat’s little theorem to congruences modulo arbitrary (that is, 
possibly nonprime) integers. Note that it is not true that o” = araodn for all 
a and n: for example, 2 4 is not congruent to 2 modulo 4. What is true? (This 
generalization is known as Euler’s theorem.) 

8.17. > Assume G is a finite abelian group, and let p be a prime divisor of |<?|. 
Prove that there exists an element in G of order p. (Hint: Let g be any element 
of G, and consider the subgroup (g); use the fact that this subgroup is cyclic to 
show that there is an element h € {g) in G of prime order q. IS q = p, you are done; 
otherwise, use the quotient G/{h) and induction.) (§8,5, 8.18, 8.20, §IV.2.1] 

8.18. Let G be an abelian group of order 2n, where « is odd. Prove that G has 
exactly one element of order 2. (It has at least one, for example by Exercise 8.17. 
Use Lagrange’s theorem to establish that it cannot have more than one.) Does the 
same conclusion hold if G is not necessarily commutative? 

a full 360° clockwise without spilling any water, taking it back to the original position. Thus, the 
square of the loop is (homotopically) trivial, as it should be if the fundamental group is cyclic of 
order 2. 
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8.19. Let G be a finite group, and let d be a proper divisor of |G|. Is it necessarily 
true that there exists an element of G of order d! Give a proof or a counterexample. 


8.20. t> Assume G is a finite abelian group, and let d be a divisor of [G|. Prove 
that there exists a subgroup H C G of order d. (Hint: induction; use Exercise 8.17.) 
[§IV.2.2] 


8.21. > Let H, K be subgroups of a group G. Construct a bisection between the 
set of cosets hK with A € H and the set of left-cosets of H D K in H. If H and K 
are finite, prove that 


\HK\ = 


\H\-\K\ 
|Fnjf| ' 


[§8.5] 


8.22. > Let ip : G — » G' be a group homomorphism, and let N be the smallest 
normal subgroup containing im^>. Prove that G'/N satisfies the universal property 
of coker ip in Grp. (§8.6) 


8.23. t> Consider the subgroup 

* = {(l 2 s) ’(2 1 3)} 

of S3. Show that the cokemel of the inclusion H ^ S3 is trivial, although H ^ S 3 
is not surjective. (§8.6] 

8.24. t> Show that epimorphisms in Grp do not necessarily have right-inverses. 
(§1.4.2) 


8.25. Let if be a commutative normal subgroup of G. Construct an interesting 
homomorphism from G/H to Aiit(Jf). (Cf. Exercise 7.10.) 


9. Group actions 

9.1, Actions. As mentioned in §4.1, an action of a group G on an object A of a 
category C is simply a homomorphism 

<r : G -*■ Ante (A). 

The way to interpret this is that every element g € G determines a ‘transformation 
of A into itself’, i.e., an isomorphism of A in C, and this happens compatibly with 
the operation of G and composition in C. 

In a rather strong sense, we really only care about groups because they act on 
things: knowing that G acts on A tells us something about A; group actions are 
one key tool in the study of geometric and algebraic entities. 

In fact, group actions are one key tool in the study of groups themselves: one 
of the best ways to ‘understand 1 a group is to let it act on an object A, hoping 
that the corresponding homomorphism <r is an isomorphism, or at least an injective 
monomorphism. For example, we were lucky with D$ in §2.2: we let De act on a 
set with three elements (the vertices of an equilateral triangle) and observed that 
the resulting u is an isomorphism. Thus De = S3. We would be almost as lucky 




9. Group actions 


109 


by letting Dg act on the vertices of a square: then o would at least realize £>$ as 
an explicit subgroup of S 4 , which simplifies its analysis. 

Definition 9.1. An action of a group G on an object A of a category C is faithful 
(or effective) if the corresponding c r : G — * Autc(A) is injective. j 

The case C = Set is already very rich, and we focus on it in this chapter. 

9.2. Actions on sets. Spelling out our definition of action in case A is a set, so 
that Autc(A) is the symmetric group Sa, we get the following: 

Definition 9.2. An action of a group G on a set A is a set-function 

p:G x A-> A 

such that p(ea, a) = a for all a € A and 

(Vg, h £ G), (Vo € A) : p(gh, a) = p(g, p(h, o)) . -J 

Indeed, given a function p satisfying these conditions, we can define a : G — * 
Hom Set (A, A) by 0 ( 5 ) («) - p(g, o). (This defines o(g) as a set-function A -> A, as 
needed.) This function preserves the operation, because 

<r(gh)(a) = p(gh,a) = p(g,p(h,a)) = <r(g)(p(h, a)) = <r(j)(cr(h)(o)) 

= <r(p) o o(h)(a). 

In particular, this verifies that <r(g~ l ) acts as the inverse of o(g): because (Vo € A) 
ffGT 1 ) 0 o-(ff)(“) = c(ff -1 s)(o) = ^(cg)(o) = p(e a ,a) = a. 

Thus the image of a consists of invertible set-functions; <r is acting as a function 

<t:G->S a , 

and we have verified that this is a homomorphism, as needed. 

Conversely, given a homomorphism o : G — » Sa, define p : G x A — ► A by 
p(g, a) = 0 ( 9 ) (a); the same argument (read backwards) shows that p satisfies the 
needed properties. 

It is unpleasant to carry p along. In practice, one just writes ga for p(g, a); the 
requirements in Definition 9.2 amount then to ecu = a for all a € A and 

(Vg, k € G), (Va 6 A) : (gk)a = g(ha), 

‘as if’ p defined an associative operation. 

If G acts on A, then cqo = a for all a € A; the action of a group G on a set A 
is faithful if and only if the identity ea is the only element g of G such that ga — a 
for all a e A, that is, ‘fixing’ every element of A. An action is free if the identity 
ea is the only element fixing any element of A. 

Example 9.3. Every group G acts in a natural way on the underlying set G. The 
function p : G x G — > G is simply the operation in the group: 

(Vfl.aeG): p(g, a) - ga. 
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In this case the defining property ns ally is associativity. This is referred to as the 
action by left-multiplication 41 of G on itself. There is (at least) another very natural 
way to act with G on itself, by conjugation: define p : G x G -* G by 

p(g,h) = ghg~ l . 

This is indeed an action: Vg, h,ke G, 

p(g,p(h,k)) = gp(h,k)g~ l = gihkhT^g' 1 = (gtykighy 1 = p(gh, k). 

Example 9.4. More generally, G acts by left-multiplication on the set G/H of 
left- cosets (cf. §8.5) of any subgroup if: act by g € G on aH € G/H by sending it 
to ( ga)H . j 

These examples of actions are extremely useful in studying groups, as we will see 
in Chapter rV. For instance, an immediate consequence is the following counterpart 
to §8.2: 

Theorem 9.5 (Cayley’s theorem). Every group acts faithfully on some set. That 
is, every group may be realized as a subgroup of a permutation group. 

Proof. Indeed, simply observe that the left-multiplication action of G on itself is 
manifestly faithful. □ 

The notion defined in Definition 9.2 is, for the sake of precision, called a left- 
action. A right-action would associate to each pair (g, a) with g € G and a € A an 
element ag € A ; our make-believe associativity would now say 

a(gh) = (ag)h 

for all a € A and g, h € G. This is a different requirement than the one given in 
Definition 9.2; multiplication on the right in a group G gives a prototypical example 
of a right-action of G (on itself). 

Every right-action may be turned into a left-action with due care (cf. Exer- 
cise 9.3). Therefore it is not restrictive to just consider left-actions; from now on, 
an ‘action’ will be understood to be a left- action, unless stated otherwise. 

9.3. Transitive actions and the category G'-Set. 

Definition 9.6. An action of a group G on a set A is transitive if V«, b e A 3g € G 
such that b = ga. j 

For example, the left-multiplication action of a group on itself is transitive. 
Transitive actions are the basic ingredients making up every action; this is seen by 
means of the following important concepts. 

Definition 9.7. The orbit of a € A under an action of a group G is the set 

O c (a):= {gafge G}. j 


41 This is ie/i-multiplication in the sense that the ‘acting’ element 9 of G is placed to the left 
of the element a ‘acted upon’. 
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Definition 9.8. Let G act on a set A, and let a £ A. The stabilizer subgroup of 
a consists of the elements of G which fix a: 

StabG(a) := {g € G\ga = a}. j 

Orbits of an action of a group G on a set A form a partition of A ; and we 
have an induced, transitive action of G on each orbit. Therefore we can, in a 
sense, ‘understand’ all actions if we understand transitive actions. This will be 
accomplished in a moment, by studying actions related to stabilizers. 

For any group G, sets endowed with a (left) G-action form in a natural way 
a category G-Set: objects are pairs (p, A), where p:GxA— *Aisan action (as 
in Definition 9.2) and morphisms between two objects are set-functions which are 
compatible with the actions. That is, a morphism 

(p,A)^(p',A') 

in G-Set amounts to a sei>fnnction ip : A — > A' such that the diagram 



commutes. In the usual shorthand notation omitting the p’s, this means that 
Vg g G, Va £ A, 

g<p(a) = p(ga); 

that is, the action ‘commutes’ with <p. Such functions are called (G-) equivariant. 

We therefore have a notion of isomorphism of G-sets (defined as in §1.4.1); the 
reader should expect (and should verify) that these are nothing but the equivariant 
bijections. 

Among G-sets we single out the sets G/H of left-cosets of subgroups H of G; 
as noted in Example 9.4, G acts on G/H by left-multiplication. 

Proposition 9.9. Every transitive left-action of G on a set A is isomorphic to the 
left-multiplication of G on G/H, for H — the stabilizer of any a £ A. 

Proof. Let G act transitively on a set A, let a £ A be any element, and let 
H = Stabo(o). We claim that there is an equivariant Injection 

ip : G/H -> A 

defined by 

) := 

for all g £ G. 

Indeed, first of all <p is well-defined; if g\H = g%H, then pj - 1 p 2 € H, hence 
(9\ l 92) a = and it follows that g\a = g%a as needed. To verify that ip is bijective, 
define a function if : A — * G/H by sending an element pa of A to gH; if is well- 
defined because if gja = g$a, then gf *( 520 ) = a, so gf ^2 € H and g\H = g^H. It 
is clear that <p and if are inverses of each other; hence <p is a Injection. 

Equivariance is immediate: p(g , (gH)) = g'ga = <f<p{gH). □ 



112 


II. Groups, first encounter 


Corollary 9.10. If O is an orbit of the action of a finite group G on a set A, then 
O is a finite set and 


\o\ divides |G|. 


Proof. By Proposition 9.9 there is a bijection between O and G/Stab G («t) for any 
element o € O; thus 


|0|.|Stab G (a)| = |G| 


by Corollary 8.14. 


□ 


Corollary 9.10 upgrades Lagrange’s theorem to orbits of any action; it is ex- 
tremely useful, as it provides a very strong constraint on group actions. 

Example 9.11. There are no transitive actions of S 3 on a set with 5 elements. 
Indeed, 5 does not divide 6. j 

Ultimately, almost everything we will prove in Chapter IV on the structure of 
finite groups will be a consequence of ‘counting arguments’ stemming from applica- 
tions of Corollary 9.10 to actions of a group by conjugation or left-multiplication. 

There may seem to be an element of arbitrariness in the statement of Propo- 
sition 9.9: what if we change the element a of which we are taking the stabilizer? 
The stabilizer may change, but it does so in a controlled way; 

Proposition 9.12. Suppose a group G acts on a set A, and let a 6 A, g € G, 
b = ga. Then 


Stab G (6) = ffStab<j(a)g *. 


Proof. Indeed, assume h e Stabc(a); then 


(ghg 1 )(b) = gh(g l g)a = gha = ga = b : 

thus gkg~ l e Stab G (i>). This proves the D inclusion; C follows by the same argu- 
ment, noting that o = g~ l b. □ 


For example, if Stab G (o) happens to be normal, then it is really independent 
of a (in any given orbit). In any case, there is an isomorphism of G-sets beween 
G/H and G/(gHg~ 1 ), as follows from these considerations (and as the reader will 
independently check in Exercise 9.13). 
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Exercises 


9.1. (Once more, if you are already familiar with a little linear algebra. . .) The 
matrix groups listed in Exercise 6.1 all come with evident actions on a vector 
space: if M is an » x n matrix with (say) real entries, multiplication to the right by 
a column n-vector v returns a column ?r- vector Mv, and this defines a left-action 
on R R viewed as the space of column n-vectors. 

• Prove that, through this action, matrices M e 0„(R) preserve lengths and angles 
in R” 

• Find an interesting action of SL 2 (C) on R 3 . (Hint: Exercise 8.9.) 

9.2. The effect of the matrices 

(5 -U (-°> 1) 

on the plane is to respectively flip the plane about the y-axis and to rotate it 90° 
clockwise about the origin. With this in mind, construct an action of £>g on R 2 . 

9.3. If G = (G, •) is a group, we can define an ‘opposite’ group G° = (G, •) 
supported on the same set G, by prescribing 

(Vy.fteG): gah:=hg. 

• Verify that G° is indeed a group. 

• Show that the ‘identity’: G° — » G, g i-> g is an isomorphism if and only if G is 
commutative. 

• Show that G° = G (even if G is not commutative!). 

• Show that giving a right- action of G on a set A is the same as giving a homo- 
morphism G° -* Sa, that is, a left - action of G° on A. 

• Show that the notions of left- and right-actions coincide ‘on the nose’ for com- 
mutative groups. (That is, if (y, a) t-> ag defines a right-action of a commutative 
group G on a set A, then setting ga = ag defines a left-action). 

• For any group G, explain how to turn a right-action of G into a left-action 
of G. (Note that the simple ‘Hip’ ga = ag does not work in general if G is not 
commutative.) 

9.4. As mentioned in the text, nyfct-multiplication defines a right-action of a group 
on itself. Find another natural right^action of a group on itself. 

9.5. Prove that the action by left-multiplication of a group on itself is free. 

9.6. Let O be an orbit of an action of a group G on a set. Prove that the induced 
action of G on O is transitive. 

9.7. Prove that stabilizers are indeed subgroups. 

9.8. Fbr G a group, verify that G-Set is indeed a category, and verify that the 
isomorphisms in G-Set are precisely the equivariant bisections. 
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9.9. Prove that G-Set has products and coproducts and that every finite object of 
G-Set is a coproduct of objects of the type G/H = {left-cosets of H}, where H is 
a subgroup of G and G acts on G/H by left-multiplication. 

9.10. Let H be any subgroup of a group G. Prove that there is a bijeetion between 
the set G/H of le/t-cosets of H and the set H\G of ny/it-cosets of H in G. (Hint: 
G acts on the right on the set of right-cosets; use Exercise 9.3 and Proposition 9.9.) 

9.11. -i Let G be a finite group, and let H be a subgroup of index p, where p is 
the smallest prime dividing |G|. Prove that H is normal in G, as follows: 

• Interpret the action of G on G/H by left-multiplication as a homomorphism 
<r : G — » Sp. 

• Then G / ker <r is (isomorphic to) a subgroup of S p . What does this say about 
the index of ker a in G'( 

• Show that ker <r C H. 

• Conclude that H = ker c, by index considerations. 

Thus H is a kernel, proving that it is normal. (This exercise generalizes the result 
of Exercise 8.2.) [9.12] 

9.12. — * Generalize the result of Exercise 9.11, as follows. Let G be a group, and 
let H C G be a subgroup of index n. Prove that H contains a subgroup K that is 
normal in G and such that [G : K] divides the gcd of |G| and n!. (In particular, 
[G : K] < n!.) [IV.2.23] 

9.13. t> Prove ‘by hand’ that for all subgroups H of a group G and V<? € G, G/H and 
G/(gHg~ l ) (endowed with the action of G by left-multiplication) are isomorphic 
in G-Set. [§9.3j 

9.14. -i Prove that the modular group PSLj(S) is isomorphic to the coproduct C% * 
C 3 . (Recall that the modular group PSL 2 (Z) is generated by x = (° ~J) and 
y = ( 1 ~o), satisfying the relations x 2 = y s = e in PSL 2 (Z) (Exercise 7.5). The 
task is to prove that x and y satisfy no other relation: this will show that PSL 2 (Z) 
is presented by (x, p| x 2 ,^ 3 ), and we have agreed that this is a presentation for 
Ci * C 3 (Exercise 3.8 or 8.7). Reduce this to verifying that uo products 

(y^Hl^M'-fe* 1 *) or (.y ± 1 so)(y ±l x)--(,y tl x)y ±1 

with one or more factors can equal the identity. This latter verification is tradi- 
tionally carried out by cleverly exploiting an action 42 . Let the modular group act 
on the set of irrational real numbers by 

(a b\ . . ar + b 
\c d)^ = ^Td- 

Check that this does define an action of PSL 2 (S), and note that 

11 r 

y(r) = 1 - j/ -1 (r) = — 1 + r > Sf Mr) = 

42 The modular group acts on CU{oo} by Mobius transformations. The observation that it 
suffices to act onR\Q for the purpose of thiB verification is due to Roger Alperin. 
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Now complete the verification with a case-by-case analysis. For example, a product 
(y ±1 x)(y ±1 x) ■ ■ ■ (y ±1 x)y cannot equal the identity in PSL 2 (5S) because if it did, it 
would act as the identity on R\ Q, while if r < 0, then y(r) > 0, and both yx and 
y~ l x send positive irrationals to positive irrationals.) [3.8] 

9.15. -i Prove that every (finitely generated) group G acts freely on any corre- 
sponding Cayley graph. (Cf. Exercise 8.6. Actions on a directed graph are defined 
as actions on the set of vertices preserving incidence: if the vertices vi, v 2 are 
connected by an edge, then so must be gv i, g» 2 for every g € G.) In particular, 
conclude that every free group acts freely on a tree. [9.16] 

9.16. > The converse of the last statement in Exercise 9.15 is also true: only free 
groups can act freely on a tree. Assuming this, prove that every subgroup of a free 
group (on a finite set) is free. [§6.4] 

9.17. > Consider G as a C-set, by acting with left-multiplication. Prove that 
Aut c-Srf(G) s G. [ 52 . 1 ] 

9.18. Show how to construct a groupoid carrying the information of the action of 
a group G on a set A. (Hint: A will be the set of objects of the groupoid. What 
will be the morphisms?) 


10. Group objects in categories 

10.1. Categorical viewpoint. The definition of group (Definition 1.2) is firmly 
grounded on the category Set: A group is a set G endowed with a binary opera- 
tion. . . . However, we have noticed along the wry (for example, in §3) that what is 
really behind it is a pair of functions: 

m:GxG-*G, t:G-*G 

satisfying certain properties (which translate into associativity, existence of inverses, 
etc.). Much of what we have seen could be expressed exclusively in terms of these 
functions, systematically replacing considerations on ‘elements’ by suitable commu- 
tative diagrams and enforcing universal properties as a means to define key notions 
such as the quotient of a group by a subgroup. For example, homomorphisms may 
be defined purely in terms of the commutativity of a diagram: cf. Definition 3.1. 

This point of view may be transferred easily to categories other than Set, and 
the corresponding notions are very important in modern mathematics. 

Definition 10.1. Let C be a category with (finite) products and with a final 
object 1. 

A group object in C consists of an object G of C and of morphisms 


m : G x G — * G, e : 1 — ► G, i : G — ► G 
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in C such that the diagrams 


(GxG)xG ,ltx ' de > GxG 


Gx(GxG) idaXm M 3 


lxG J^ GxG 


Gxl -^GxG 


G 


G-^GxG^>GxG 


-±G 


G A > GxG ‘ xiJ<: > GxC 


-M3 


commute. _i 

Comments. The morphism A = idc x idc is the ‘diagonal* morphism G —* 
G x G induced hy the universal property for products and the identity map(s) 
G —f G. Likewise, the other unnamed morphisms in these diagrams are all uniquely 
determined by suitable universal properties. For example, there is a unique mor- 
phism e : <3 —* 1 because 1 is final. The composition with the projection 


G^lxG >G 


is the identity; so is 


lxG >G <X1< S l xG 

(why?); therefore the projection 1 x G — » G is indeed an isomorphism, as indicated. 

The reader will hopefully realize immediately (Exercise 10.2) that our original 
definition of groups given in §1 is precisely equivalent to the definition of group 
object in Set: the commutativity of the given diagrams codifies associativity and 
the existence of two-sided identity and inverses. 

Most interesting categories the reader will encounter (not necessarily in this 
book), such as the category of topological spaces, differentiable manifolds, algebraic 
varieties, schemes, etc., will carry ‘their own’ notion of group object. For example, 
a topological group is a group object in the category of topological spaces; a Lie 
group is a group object in the category of differentiable manifolds, etc. 
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Exercises 


10.1. Deiiue all the uimamed maps appearing iii the diagrams in the definition of 
group object, and prove they are indeed isomorphisms when so indicated. (Fbr the 
projection 1 x G — » G, what is left to prove is that the composition 

lxG->G-»lxG 

is the identity, as mentioned in the text.) 

10.2. o Show that groups, as defined in §1.2, are ‘group objects in the category of 
sets’. [§10.1] 

10.3. Let (G, •) he a group, and suppose o:GxG— »Cisa group homomorphism 
(w.r.t. •) such that (G, o) is also a group. Prove that o aud • coincide. (Hint: First 
prove that the identity with respect to the two operations must be the same.) 

10.4. Prove that every abelian group has exactly one structure of group object in 
the category Ab. 

10.5. By the previous exercise, a group object in Ab is nothing other than an 
abelian group. What is a group object in Grp? 




Chapter III 


Rings and modules 


1. Definition of ring 

In this chapter we will do for rings and modules what we have done in Chapter II for 
groups: describe them in general terms, with particular attention to distinguished 
subobjects and quotients. More detailed information on these structures will be 
deferred to later chapters; in Chapter V we will look more carefully at several 
interesting classes of rings, and modules (over commutative rings) will take center 
stage in our rapid overview of linear algebra in Chapter VI and following. In this 
chapter we will also include a brief jaunt into homological algebra, a topic that will 
entertain us greatly in Chapter IX. 

1.1. Definition. Rings (and modules) are defined by ‘decorating’ abelian groups 
with additional data. As motivation for the introduction of such structures, note 
that all number-based examples of groups that we have encountered, such as Z or R, 
are endowed with an operation of multiplication as well as the 'addition’ making 
them into (abelian) groups. The ‘ring axioms’ wall reflect closely the properties and 
compatibilities of these two operations in such examples. 

These examples are, however, very special. A more sophisticated motivation 
for the introduction of rings arises by analyzing further the structure of homomor- 
phisms of abelian groups. Recall (§11.4.4) that if G, H are abelian groups, then 
Hom A b((3, ff) is also an abelian group. In particular, if G is an abelian group, then 
so is the set of endomorphisms End A j,(<7) = Hom Ab (G, G). More is true: mor- 
phisms from an object of a category to itself may be composed with each other (by 
definition of category!). Thus, two operations coexist in End A b(G): addition (in- 
herited from G, making End A b(G) an abelian group), and composition. These two 
operations are compatible with each other in a sense captured by the ring axioms: 

Definition 1.1. A ring (R, +, •) is an abelian group (R, +) endowed with a second 
binary operation -, satisfying on its own the requirements of being associative and 
having a two-sided identity, i.e., 
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• (Vr, s,t&R): (r • s) • t = r • (a • t ) , 

.(31 * € *) (Vr € fl) : r • 1 R = r = 1 R ■ r 

(which make (R, •) a monoid), and further interacting with + via the following 
distributive properties: 

. (Vr, s, t 6 R) : ( r + s)-t = r-s + r- t and t ■ (r + s) = t ■ r + 1 ■ s. j 

The notation • is often omitted in formulas, and we usually refer to rings by 
the name of the underlying set. 

Warning: What we are calling a ‘ring’, others may call a ‘ring with identity’ 
or a ‘ring with 1’: it is not uncommon to exclude the axiom of existence of a 
‘multiplicative identity’ from the list of axioms defining a ring. Bings without 
Identity are sometimes called mgs, but we are not sure this should be encouraged 1 . 
The reader should check conventions carefully when approaching the literature. 
Examples of structures without a multiplicative identity abound: for example, the 
set 2Z of even integers, with the usual addition and multiplication, satisfies all the 
ring axioms given above with the exception of the existence of 1 (and is therefore 
a mg). But in these notes all rings will have 1. 

Of course the multiplicative identity' is necessarily unique: the argument given 
for Proposition II.1.6 works verbatim. 

The identity element of the abelian group underlying a ring is denoted 0 .r (or 
simply 0, in context) and is called the ‘additive’ identity. This is a special element 
with respect to multiplication: 

Lemma 1.2. In a ring R, 

0 ■ r = 0 = r • 0 


for all r e R. 

Proof. Indeed, 0 = 0 + 0; hence, applying distributivity, 
r-0 = r-(0 + 0) = r’0 + r'0, 

from which r • 0 = 0 by cancellation (in the group (R, +)). The equality 0 ■ r = 0 is 
proven similarly. □ 

It is equally easy to check that multiplication behaves as expected on ‘subtrac- 
tion’. In fact, if —1 denotes the additive inverse of 1, then the additive inverse — r 
of any r € R is the result of the multiplication (—1) • r: indeed, using distributivity, 

r + (—1) ■ r = 1 • r + (—1) • r = (1 — 1) ■ r = 0 • r = 0 

(by Lemma 1.2) from which (— l)r = —r follows by (additive) cancellation. 


'The terra ‘rng’ was introduced with this meaning by Jacobson; but essentially at the same 
lime Mac Lane introduced Rng as the name for the category of rings with identity. Hoping to steer 
clear of this clash of terminology we have opted to call this category 'Ring 1 . 
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1.2. First examples and special classes of rings. 

Example 1.3. We can define a ring structure on a trivial group {*} by letting 
* • * = * (as well as * + * = *); this is often called the zero-ring. Note that 0=1 
in this ring (cf. Exercise 1.1). j 

Example 1.4. More interesting examples are the number-based groups such as Z 
or R, with the usual operations. These are very well known to our reader, who will 
realize immediately that they satisfy the requirements given in Definition 1.1; but 
they are very special. Why? _i 

To begin with, note that multiplication is commutative in these examples; this 
is not among the requirements we have posed on rings in the official definition given 
above. 

Definition 1.5. A ring R is commutative if 

• (Vr, s e R) : r ■ $ = s ■ r. j 

Commutative rings (with identity) form an extremely important class of rings; 
commutative algebra is the subiield of algebra studying them. We will focus on 
commutative rings in later chapters; in this chapter we will develop some of the 
basic theory for the more general case of arbitrary rings (with 1). 

Example 1.6. An example of a noncommutative ring that is (likely) familiar to 
the readers is the ring of 2 x 2 matrices with, say, real entries: matrices can be 
added ‘componentwise’, and they can be multiplied as recalled in Example II.1.5; 
the two operations satisfy the requirements in Definition 1.1. 

Square matrices of any size, and with entries in any ring, form a ring (Exer- 
cise 1.4). j 

Example 1.7. The reader is already familiar with a large class of (commutative) 
rings: the groups Z/nZ, endowed with the multiplication defined in §11.2.3 (that is: 
[a]„ • (i] n := (at) n ; this is well-defined (cf. Exercise II.2.14)) satisfy the ring axioms 
listed above. -j 

The rings Z/nZ prompt us to highlight an important point. Another reason 
why rings such as Z, Q, R, . . . sire special is that multiplicative cancellation by 
nonzero elements holds in these rings. Of course additive cancellation is automatic, 
since rings are in particular- (abelian) groups; and multiplicative cancellation clearly 
fails in general since one cannot ‘cancel 0’ (by Lemma 1.2). But even the fact that 

(Va 6 R), a / 0 : a - b = a - c =>• b = c, 
which holds, for example, in Z, does not follow from the ring axioms. 

Indeed, this cancellation property does not hold in all rings; it may well fail In 
the rings Z/nZ. For example, 

12)6 ■ [4] 6 = [8J« = [2) 6 = [2] 6 - [l]e 

even though [4]e / [l]e. 

The problem here is that in Z/6Z there are elements a / 0 such that a ■ b = 0 
for some 6/0 (take a = [2]e, 6 = (3]s). 
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Definition 1.8. An element a in a ring R. is a (left-)zero-divisor if there exist 
elements b ^ 0 in R for which o& = 0. _j 

The reader will have no difficulty figuring out what a riy/it-zero-divisor should 
be. The element 0 is a zero-divisor in all nonzero rings R; the zero ring is the only 
ring without zero-divisors(l). 

Proposition 1.9. In a ring R, 

• o € R is not o left- (resp., right- ) zero-divisor if and only if left (resp., right) 
multiplication by a is an injective function R.—*R. 

In other words, a is not a left- {resp., right-) zero-divisor if and only if multi- 
plicative left- (resp., right-) cancellation by the element a holds in R. 

Proof. Let’s verify the ‘left’ statement (the ‘right’ statement is of course entirely 
analogous). Assume a is not a left-zero-divisor and ab= ac for 6,c 6 R. Then, by 
distributivity, 

a(b — c) = ab — ac = 0, 

and this implies b — c = 0 since a is not a left-zero-divisor; that is, b = c. This 
proves that left- multiplication is injective in this case. 

Conversely, if a is a left-zero-divisor, then 36 ^ 0 such that ab — 0 = o - 0; this 
shows that left-multiplication is not injective in this case, concluding the proof. □ 

Rings such as Z, Q, etc., are commutative rings without (nonzero) zero-divisors. 
Such rings are very special, but very important, and they deserve their own termi- 
nology: 

Definition 1.10. An integral domain is a nonzero commutative ring R. (with 1) 
such that 

(Vrt, b € i?) : ab= 0 => a = 0 or b = 0. j 

Chapter V will be entirely devoted to integral domains. 

An element which is not a zero-divisor is called a non- zero-divisor. Thus, inte- 
gral domains are those nonzero commutative rings in which every nonzero element 
is a non-zero-divisor. By Proposition 1.9, multiplicative cancellation by nonzero 
elements holds in integral domains. The rings Z, <Q, R, C are all integral domains. 
As we have seen, some Z/nZ are not integral domaius. 

Here is one of those places where the reader can do him/herself a great favor by 
pausing a moment and figuring something out: answer the question, which Z/rtZ 
are integral domains ? This is entirely within reach, given what the reader knows 
already. Don’t read ahead before figuring this out — this question will be answered 
within a few short paragraphs, spoiling all the fun. 

There are even subtler reasons why Z is a very special ring: we will see in due 
time that it is a ‘UFD’ (unique factorization domain); in fact, it is a ‘PID’ (principal 
ideal domain ); in fact, it is more special still!, as it is a 'Euclidean domain’. All of 
this will be discussed in Chapter V, particularly §V.2. 
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However, Q, R, C are more special than all of that and then some, since they 
are fields. 

Definition 1.11. An element u of a ring P is a left-unit if 3v € B. such that uv = 1; 
it is a right-unit if 3» € R such that vu = 1. Units are two-sided units. j 

Proposition 1.12. In a ring R: 

mi i is a left- (reap., right-) unit if and only if left- (reap., right-) multiplication 
by u is a surjective functions R —* R; 

m if u is a left- (resp., right-) unit, then right- (resp., left-) multiplication by u 
is injective; that is, u is not a right- (resp., left-) zero-divisor; 
m the inverse of a two-sided unit is unique; 
m two-sided units form a group under multiplication. 

Proof. These assertions are all straightforward. For example, denote by : R -* 
R right-multiplication by u, so that p„(r) = ru. If u is a right-unit, let v € R be 
such that vu = 1; then Vr e R 

p u o Pv(r) = p u (rv) = (rv)u = r(vu) = rip = r. 

That is, p v is a right- inverse to p u , and therefore p u is surjective (Proposition 1.2.1), 
Conversely, if p u is surjective, then there exists a v such that 1 h = P(«)(«) = vu, 
so that a is a right-unit. 

This checks the first statement, for right-units. 

For the second statement, denote by A„ : R. — ► R. ie/f-multiplication by u: 
A u (r) = ur. Assume u is a right-unit, and let v be such that vu = lj?; then Wei? 

X v o A u (r) = A„(w) = v(ur) = (vu)r = 1/jr = r. 

That is, A„ is a left-inverse to A„, so A u is injective (Proposition 1.2.1 again). 

The rest of the proof is left to the reader (Exercise 1.9). □ 

Since the inverse of a two-sided unit u is unique, we can give it a name; of 
course we denote it by 1 . The reader should keep in mind that inverses of left- or 
right-units are not unique iu general, so the ‘inverse notation’ is not appropriate 
for them. 

Definition 1.13. A division ring is a ring in which every nonzero element is a 
two-sided unit. _i 

We will mostly be concerned with the commutative case, which has its own 
name: 

Definition 1.14. A field is a nonzero commutative ring J?. (with 1) in which every 
nonzero element is a unit. j 

The whole of Chapter VII will be devoted to studying fields. 

By Proposition 1.12 (second part), every field is an integral domain, but not 
conversely: indeed, Z is an integral domain, but it is not a field. Remember: 

field =*- integral domain, 
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integral domain =£• field. 

There is a situation, however, in which the two notions coincide: 

Proposition 1.15. Assume R is a finite commutative ring; then R is an integral 
domain if and only if it is a field. 

Proof. One implication holds for all rings, as pointed out above; thus we only have 
to verify that if R is a finite integral domain, then it is a field. This amounts to 
verifying that if a is a non-zero-divisor in a finite (commutative) ring R, then it is 
a unit in R. 

Now, if a is a non-zero-divisor, then multiplication by a in 1? is injective (Propo- 
sition 1.9): hence it is surjective, as the ring is finite, by the pigeon-hole principle; 
hence a is a unit, by Proposition 1.12. □ 

Remark 1.16. A little surprisingly, the hypothesis of commutativity in Proposi- 
tion 1.15 is actually superfluous: a theorem known as Wedderbum’s little theorem 
shows that finite division rings are necessarily commutative. The reader will prove 
this fact in a distant future (Exercise VII.5.14). _i 

Example 1.17. The group of units in the ring Z/nZ is precisely the group (Z/nZ)* 
introduced in §11.2.3: indeed, a class [m] n is a unit if and only if (right-) multipli- 
cation by (m)n is surjective (by Proposition 1.12), if and only if the map a i-> a[m]„ 
is surjective, if and only if [m] n generates Z/nZ, if and only if gcd(m, ») = 1 
(Corollary II.2.5), if and only if [m]„ € (Z/reZ)*. 

In particular, those n for which all nonzero elements of Z/nZ are units (that is, 
for which Z/nZ is a field) are precisely those n e Z for which gcd(m, re) = l for all 
m that are not multiples of n; this is the case if and only if re is prime. Putting this 
together with Proposition 1.15, we get the pretty classification (for integers p 0) 

Z/pZ integral domain Z/pZ field p prime, 

which the reader is well advised to remember firmly. j 

Example 1.18. The rings Z/pZ, with p prime, are not the only finite fields. In 
fact, for every prime p and every integer r > 0 there is a (unique, in a suitable 
sense) multiplication on the product group 

Z/pZ x ■■■ x Z/pZ 

' V ' 

r times 

making it into a field. A discussion of these fields will have to wait until we have 
accumulated much more material (cf. §VII.5.1), but the reader could already try 
to construct small examples ‘by hand’ (cf. Exercise 1.11). j 

1.3. Polynomial rings. We will study polynomial rings in some depth, especially 
over fields; they are another class of examples that is to some extent already familiar 
to our reader. I will capitalize on this familiarity and avoid a truly formal (and 
truly tedious) definition. 
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Definition 1.19. Let R be a ring. A polynomial f(x) in the indeterminate x and 
with coefficients in R is a finite linear combination of nonnegative ‘powers’ of x 
with coefficients in R: 

/(x ) = a;x’ = do + + c 2 x 2 H , 

i>0 

where all a; are elements of R (the coefficients) and we require oj = 0 for i ~S> 0. 
Two polynomials are taken to be equal if all the coefficients are equal: 

^ Oix‘ = biX 1 «=*- (Vi > 0) : rq = 6*. j 

i>0 i> 0 

The set of polynomials in x over R is denoted by l?[x). Since all but finitely 
many a* are assumed to be 0, one usually employs the notation 

f(x) = oo + aixH 1- a n x n 

for Ylixt ft;* 1 * if a« = 0 for i > n. 

At this point the reader should just view all of this as a notation: a polynomial 
really stands for an element of an infinite direct sum of the group (R, +). The 
‘polynomial’ notation is more suggestive as it hints at what operations we are going 
to impose on /?(*]: if 

/(*) = ^ tux' and g(x) = ^ bix\ 

t>0 t>0 

then we define 

f(x) + g(x) := ^(aj + 6 t )x' 

t>0 

and 

f(x) ■ 9(x) :=E H <nbjX i+j . 
k> 0 i+j=k 

To clarify this latter definition, see how it works for small k: /( x) ■ g(x) equals 
aoto + (ooti + Ci to )x + (ao&2 + 0i6i + cmba)x 2 + (aots + Oita 4- 0261 + <*360 )x 3 + - • ■ , 
that is, business as usual. 

It is essentially straightforward (Exercise 1.13) to check that /?[x], with these 
operations, is a ring; the identity 1 of R is the identity of jR[x], when viewed as a 
polynomial (that is, 1 r[i] = 1/} + Ox + Ox 2 + ■■•). 

The degree of a nonzero polynomial f(x) = 5Zt>o <*«•**> denoted deg /(x), is the 
largest integer d for which ad ^ 0. This notion is very useful, but really behaves 
well (Exercise 1.14) only if R. is an integral domain: for example, note that over 
R = Z/6Z 

deg([l] + (2]x) = 1, deg([l] + (3}x) = 1, but 
deg(((l] + [2]x) ■ (l + [3]x)) = deg([l] + (5)x) = 1^1 + 1. 

Polynomials of degree 0 (together with 0) are called constants; they form a ‘copy’ 
of J? in f?[x], since the operations +, ■ on constant polynomials are nothing but the 
original operations in R, up to this identification. It is sometimes convenient to 
assign to the polynomial 0 the degree —00. 
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Polynomial rings in more indeterminates may be obtained by iterating this 
construction: 

R[x,y,z] := i?[a;][i/)[z]; 

elements of this ring may be written as ‘ordinary’ polynomials in three indetermi- 
nates and are manipulated as usual. It can be checked easily that the construction 
does not really depend on the order in which the indeterminates are listed, in the 
sense that different orderings lead to isomorphic rings (in the sense soon to be de- 
fined officially). Different indeterminates commute with each other; constructions 
analogous to polynomial rings, but with noncomuiuting indeterminates, are also 
very important, but we will not develop them in this book (we will glance at one 
such notion in Example VIII.4.17). 

We will occasionally consider a polynomial ring in infinitely many indetermi- 
nates: for example, we will denote by R{x\,i 2 , . . .] the case of countably many 
indeterminates. Keep in mind, however, that polynomials are finite linear combi- 
nations of finite products of the indeterminates; in particular, every given element 
of R[xi,X 2 , . . . ] only involves finitely many indeterminates. An honest definition of 
this ring involves direct limits, ■which await us in § VIII. 1.4. 

Rings of power series may be defined and are very useful; the ring of series 
atx 1 = ac+aiar+aax 2 -)-. . . in. c with coefficients in R, and evident operations, 
is denoted R[[x]j. Regrettably, we will only occasionally encounter these rings in 
this book. 

The ring f?[x) is (clearly) commutative if R is commutative; it is an integral 
domain if R is an integral domain (Exercise 1.15); but it has no chances of being 
a field even if R is a field, since x has no inverse in R(*j. The question of which 
properties of R are ‘inherited’ by H[:r] is subtle and important, and we will give it 
a great deal of attention in later sections. 

1.4. Monoid rings. The polynomial ring is an instance of a rather general con- 
struction, which is occasionally very useful. A semigroup is a set endowed with an 
associative operation; a monoid is a semigroup with an identity element. Thus a 
group is a monoid in which every element has an inverse; positive integers with 
ordinary addition form a semigroup, while the set N of natural numbers (that is, 
nonnegative integers 2 ) is a monoid under addition. 

Oiven a monoid (M, -) and a ring R, we can obtain a new ring as follows. 
Elements of R\M\ are formal linear combinations 

) ] Orn ' m 
m£M 

where the ‘coefficients’ r m are elements of R and a m ^ 0 for at most finitely many 
summands (hence, as in §1.3, as an abelian group R.[M ] is nothing but the direct 
sum i?® M ). Operations in i?.[M] are defined by 

( ) j U?n ' TO) t ( ) * b 7 n ' TO) = ) ' (^m t bjn ) * TO, 
n»6 M mSM m€M 


2 Soroe disagree, and insist that N should not include 0. 
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( ^ ' 771 ) * ( ^ b m ' 77i) — ^ (®mi^t7n) ' 777. 

me M meM meM mitn 2 =m 

The identity in B\M] is 1 b- 1m. viewed as a formal sum in which all other summands 
have 0 as coefficient. 

The reader will hopeMly see the similarity with the construction of the poly- 
nomial ring 7?[x] in §1.3; in fact (Exercise 1.17) the polynomial ring ft[a-] may he 
interpreted as J7.|NJ. 

Group rings are the result of this construction when M is in fact a group. The 
group ring f?.[Z] is a ring of ‘Laurent polynomials’ R.\x,x~ l ], allowing for negative 
as well as positive exponents. 


Exercises 


1.1. > Prove that if 0 = 1 in a ring R, then R. is a zero-ring. [§1.2] 

1.2. -i Let S be a set, and define operations on the power set @*{S) of S by setting 
VA,Be&(S) 

A + B:=(AvB) v(AnB), AB=ADB: 



A + B AB 


(where the solid black contour indicates the set included in the operation). Prove 
that (.'?*(£'), +, •) is a commutative ring. [2.3, 3.15] 

1.3. — > Let R be a ring, and let S be any set. Explain how to endow the set R s of 
set- functions S -* R of two operations + , - so as to make R. s into a ring, such that 
R. s is just a copy of R if S is a sigleton. [2.3] 

1.4. t> The set of n x n matrices with entries in a ring R is denoted M n (R). Prove 
that componentwise addition and matrix multiplication make M n (R) into a ring, 
for any ring R. The notation g [„(!?.) is also commonly used, especially for R = R 
or € (although this indicates one is considering them as Lie algebras) in parallel with 
the analogous notation for the corresponding groups of units; cf. Exercise H.6.1. hi 
fact, the parallel continues with tire definition of the following sets of matrices: 

• *l„(R) = {M e gf„(R) | tr(M) = 0}; 

. st n (C) = {M € gt^C) | tr(M) = 0}; 

• so n (R) = {M e *C n (R) | M + M l = 0}; 

• 3ti„(C) = {M € 9l»(C) | M + Aft = 0}. 

Here tr M is the trace of M, that is, the sum of its diagonal entries. The other no- 
tation matches the notation used in Exercise n.6.1. Can we make rings of these sets 
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by endowing them with ordinary addition and multiplication of matrices? (These 
sets are all Lie algebras; cf. Exercise VI.1.4.) (§1.2, 2.4, 5.9, VI. 1.2, VI.1.4] 

1.5. Let JR. be a ring. If a, 5 are zero-divisors in R, is a+6 necessarily a zero-divisor? 

1.6. -i An element a of a ring R. is nilpotent if a” = 0 for some n. 

• Prove that if a and 6 are nilpotent in R and ab = 6a, then a + 6 is also nilpotent. 

• Is the hypothesis a6 = 6a in the previous statement necessary for its conclusion 
to hold? 

[3.12] 

1.7. Prove that [m] is nilpotent in ZjnZ if and only if m is divisible by all prime 
factors of n. 

1.8. Prove that x = ±1 are the only solutions to the equation x 2 = 1 in an integral 
domain. Find a ring in which the equation x 2 = 1 has more than 2 solutions. 

1.9. c> Prove Proposition 1.12. [§1.2] 

1.10. Let R be a ring. Prove that if a € R is a right-unit and has two or more 
left-inverses, then a is not a left-zero-divisor and is a right-zero-divisor. 

1.11. t> Construct a field with 4 elements: as mentioned in the text, the underlying 
abelian group will have to be Z/2Zx Z/2Z; (0, 0) will be the zero element, and (1, 1) 
will be the multiplicative identity. The question is what (0, 1) • (0, 1), (0, 1) • (1, 0), 
(1,0) • (1,0) must be, in order to get a field. [§1.2, §V.5.1] 

1.12. t> Just as complex numbers may be viewed as combinations a + bi, where 
a, b 6 R and i satisfies the relation i 2 = — 1 (and commutes with R), we may 
construct a ring 3 H by considering linear combinations a + bi + cj ■+ dk where 
a, 6, c, d € R and i, j, k commute with R and satisfy the following relations: 

f - j 2 - k 2 - -1, ij = -ji = k, jk - -kj = i, ki — -ik = j. 

Addition in H is defined componentwise, while multiplication is defined by imposing 
distributivity and applying the relations. For example, 

(1+i+y)* (2 + fe) = l‘2+i*2-t^ •2+l*fc-M*fc+,7*fc = 2 + 2i+2j + k—j+i = 2 + 3 

(i) Verify' that this prescription does indeed define a ring. 

(ii) Compute (a+ bi + cj + dk)(a- bi - cj - dk), where a, 6, c, d € R. 

(iii) Prove that M is a division ring. 

Elements of H are called quaternions. Note that Qs := {±l,±i, ±j, ±&} forms a 
subgroup of the group of units of H; it is a noncommutative group of order 8, called 
the quatemionic group. 

(iv) List all subgroups of Qg, and prove that they are all normal. 

(v) Prove that Qg, Dg are not isomorphic. 

(vi) Prove that Qg admits the presentation (x,y \ x 2 y~ 2 , y i ,xyx~ l y). 

[§n.7.1, 2.4, IV. 1.12, IV .5. 16 , IV.5.17, V.6.19] 

3 The letter H is chosen in honor of William Rowan Hamilton. 
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1.13. > Verify that the multiplication defined in is associative. [§1.3] 

1.14. > Let R be a ring, and let f(x), g(x) e i?[a:) be nonzero polynomials. Prove 
that 

deg(/(») + g(x)) < max(deg(/(a:)),deg(g(x))). 

Assuming that R is an integral domain, prove that 

deg(/(*) ■ g{x)) = deg (/(»)) + deg(p(:c)). 

[§l-3] 

1.15. > Prove that /?[*) is an integral domain if and only if R is an integral domain. 
[§1-3| 

1.16. Let R be a ring, and consider the ring of power series /?[[s]] (cf. §1.3). 

(i) Prove that a power series oo + a i x 4- a 2 x2 H is a unit in if and only 

if oo is a unit in R. What is the inverse of 1 - x in /?■((*]]? 

(ii) Prove that f?[[a;]] is an integral domain if and only if R is. 

1.17. c> Explain in what sense /?[*] agrees with the monoid ring fl[N]. [§1.4] 


2. The category Ring 

2.1. Ring homomorphisms. Ring homomorphisms are defined in the natural 
way: if i?, S are rings, a function <p : 1? — ► S is a ring homomorphism if it preserves 
both operations and the identity element. That is, <p must be a homomorphism of 
the underlying abelian groups, 

(Vo, b € R) : <p(a + b) = y?(a) + 1 p(b), 
it must preserve the operation of multiplication, 

(Va, b e R) : ip(ab) = <p(a)<p(b), 

and finally 

V>(1r) = Is- 

It is evident that rings form a category, with ring homomorphisms as mor- 
phisms. We will denote this category by Ring. 

The zero-ring is clearly final in Ring. However, note that it is not initial: 
because of the requirement that ring homomorphisms send 1 to 1, the only rings 
to which zero-rings map homomorphically are the zero-rings (Exercise 2.1). 

The category Ring does have initial objects: the ring of integers Z (with the 
usual operations +, •) is initial in Ring. Indeed, for every ring R we can define a 
group homomorphism tp :Z—> R by 

(Vn € Z) : <p(n) = n ■ ljj, 

that is, as the ‘exponential map’ «i R corresponding to 1/j € Rr, cf. §11.4.1. But ip is 
in feet a ring homomorphism, since <p(l) — 1/e, and 

<p(mn) = = m(nl*) = (ml*) • (nli?) = p(m) ■ <p(n), 
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where the equality = holds by the distributivity axiom 4 . This ring homomorphism 
is unique, since it is determined by the requirement that y>(l) = 1 r and by the feet 
that tp preserves addition. Thus for every ring R there exists one and only one ring 
homomorphism Z — * R, showing that Z is initial in Ring. 

This should already convince the reader that Z is a very special ring. There is 
in feet nothing arbitrary about Z: knowledge of the group Z determines the ring 
structure of Z, as will be underscored below. This is one of the main reasons why 
we included the ‘identity’ axiom in the list in Definition 1.1: there are many noniso- 
morphic structures of ‘ring without identity’ on the group (Z, +) (cf. Exercise 2.15), 
but only one structure of ring with identity (Exercise 2.16). 

Ring homomorphisms preserve units: that is, if tt is a (left-, resp., right-) unit 
in R. and : R — * S is a ring homomorphism, then <p(u) is a (left-, resp., right-) 
unit. Indeed, if v is a (right-, say) inverse of u, then 

<p(u)<p(v) = < p(uv) = <p(1r) = Is, 
so that <p(v) is a (right-) inverse of y>(u). 

On the other hand, the image of a non-zero-divisor by a ring homomorphism 
may well be a zero-divisor: for example, the canonical projection tc : Z — » Z/6Z 
is a ring homomorphism, and 2 is a non-zero-divisor in Z, yet 7t(2) = [2]g is a 
zero-divisor. 

2.2. Universal property of polynomial rings. Polynomial rings satisfy a uni- 
versal property not unlike the one for free groups explored in §11.5.2. The simplest 
(and possibly most useful) case is for the polynomial rings Z[xi, • • • , #«), and with 
respect to commutative rings; we will leave the reader the pleasure of stating and 
proving fancier notions. 

Let A = {t»i, ... ,a n } be a set of order n. Consider the category SI a whose 
objects are pairs (j, R), where R. is a commutative ring 5 and 

j:A-*R 

is a set-function (cf. §11.5.2!); morphisms 

(31, Ri) — * fe R'l) 

are commutative diagrams 



A 


in which y? is a ring homomorphism. 

For example, (i,Z\x i, ■ • • ,£„]) is an object of St a, where i : A — * Z[a:i,- ,#„] 
seuds a* to Xk- 


4 The reader should parse this display carefully, as there is a potentially confusing mix of two 
operations multiples (such as ml fl ) and multiplication in R (explicitly denoted here by -). 

5 For the reader interested in generalizations: only the requirement that j(di) j(a n ) 

commute with every element of i? is needed here. 
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Proposition 2.1. (i, Z[xi, ■ ■ ■ ,a;„]) is initial in 

Proof. Let (j, R) be an arbitrary object of £$\\ we have to show that there is a 
unique morphism , *„]) — *• (j, R), that is, there exists exactly one ring 

homomorphism <p : Z[xi, ■ • ■ , i n ] — * R such that 

i 

A 



commutes. 

As usual in these verifications, the key point is that the requirements posed on 
f force its definition. The postulated commutativity of the diagram means that 
<p(xk) = y(ojfe) for k = 1, ••• ,». Then, since <p must be a ring homomorphism, 
necessarily 

<fi(Y • • - 4 *) = Y 

= Y *( m il-i»)i( a l) i ' ' ‘ . 

where t : Z — ► R is the unique ring homomorphism (as Z is initial in Ring). 

Thus, if ip exists, then it is unique. On the other hand, the formula we just 
obtained clearly 6 preserves the operations and sends 1 to 1, so it does define a ring 
homomorphism, concluding the proof. □ 

Example 2.2. For n = 1, Proposition 2.1 says that if s is any element of a ring S, 
then there is a unique ring homomorphism Z[x] — * S sending x to s and ‘extending’ 
the unique ring homomorphism t, : Z — » S. In this case the commutativity of S is 
immaterial (why?). _j 

Example 2.3. More generally, let a : R. — ► S be a fixed ring homomorphism, and 
let s € S be an element commuting with a(r) for all r 6 R.. Then there is a unique 
ring homomorphism a : f?[x] — * S extending a and sending a: to s (Exercise 2.6). 

In particular, we get an ‘evaluation map’ for polynomials over commutative 
rings as follows. Given a polynomial f(x) = Yli>o aiX * 6 i?[ac], every r 6 R 
determines an element 

f( r ) = Y a < r<: 

>>o 

this may be viewed as a(/(x)), where a is obtained as above with ot = idn : R-* R 
and s = r. 

Thus, every polynomial f(x) determines a polynomial function f : R. —* R., 
defined by r >-* /(r). It is a good idea to keep the two concepts of ‘polynomial’ and 
‘polynomial function' well distinct (cf. Exercise 2.7). j 


6 Well, it is really clear that it preserves addition; multiplication requires a bit of work, which 
the reader would be well advised to perform. This is where the commutativity of R is used. 
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2.3. Monomorphisms and epimorphisms. The kernel of a homomorphism <p : 
R — * S of rings is 

ker <p := {r e R. \ <p(r) = 0} : 

that is, it is nothing but the kernel of <p when the latter is viewed as a homomor- 
phism of groups. As such, ker <p is a subgroup of (JR., +); it satisfies an even stronger 
requirement 7 , which we will explore a* great length in §3. 

The reader may hope that an analog to Proposition II.6.12 holds in Ring, and 
this is indeed the case: 

Proposition 2.4. For a ring homomorphism <p : R —> S, the following are equiva- 
lent: 

(a) p is a monomorphism; 

(b) ker v = {0}; 

(c) cp is injective (as a set- function). 

Proof. We prove (a) =*> (b), leaving the rest to the reader. Assume <p : R — * S is 
a monomorphism and r € ker<p. Applying the extension property of Example 2.2, 
we obtain unique ring homomorpshisms ev r : Z[x) — » /?. such that ev r (a:) = r and 
ev 0 : Z[x] — ► R such that ev(as) = 0. Consider the parallel ring homomorphieras: 

Zat $R-2-*S : 

ev 0 

since <p(r) = 0 = y>(0), the two compositions <p o ev r , <p o evo agree (because they 
agree on Z and they agree on z); hence ev r = evo since <p is a monomorphism. 
Therefore 

r = ev r (x) as evo(ar) = 0. 

This proves r € ker <p =► r = 0, that is, (b). □ 

By Proposition 2.4, if S — » i? is a monomorphism, then S may be identified 
with a subset of R, the following definition formalizes this situation. 

Definition 2.5. A subring S of a ring R is a ring whose underlying set is a subset 
of R. and such that the inclusion function S <-+ R is a ring homomorphism. -i 

Equivalently, a subring of R. is a subset S C R. which contains ljj and satisfies 
the ring axioms with respect to the operations +, • induced from R. It is immedi- 
ately checked that S C R is a subring if it is a subgroup of (R, +), it is closed with 
respect to -, and it contains 1^. As a nonexample, the zero-ring is not a subring of 
a nonzero ring R (because it does not contain 1*). 

Proposition 2.4 may induce the reader to believe that Ring and Grp are rather 
similar from this general categorical standpoint. But a new phenomenon occurs 
concerning epimorphisms. A surjective map of rings is certainly an epimorphisin 
in Ring, since it is already an epimorphism in Set — we have run into this argument 
earlier (e.g., ‘(c) =k (a)’ in Proposition II.8.18); but unlike as in Set, Grp, and Ab, 

7 Note that ker ip caimot be a subrin# in any reasonable sense, according to our definition of 
ring, since it does not contain a multiplicative identity in all but very pathological situations. It 
is a Bubring if the identity requirement is omitted. 
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epimorphisms need not be surjective 8 in Ring. Indeed, consider the inclusion ho- 
momorphism of rings 

i : Z «-» Q : 

i is not surjective, and hence it is not an epimorphism in Set or Ab (can the reader 
‘describe’ coker £t Exercise 2.12); but it is an epimorphism of rings. Indeed, if «i, 
orj are parallel ring homomorphisms 

02 

and t*i, &2 agree 9 on Z, say a = ai|z = a^z, then they must agree on Q: because 
for p,96Z,s^0, 

<*i (j) = <*i(p)“i(9 -1 ) = o{p)«{qr l (i = 1*2) 

is the same for both 10 . Thus, i satisfies the categorical requirement for epimor- 
phisms (cf. §1.2.6). 

Warning: Thus, in Ring, a homomorphism may well be both a monomorphism 
and an epimorphism without being an isomorphism! 

2.4. Products. Products exist in Ring; if R\, Rq are rings, then R.i x J?2 may 
be defined by endowing the direct product of groups i?i x i?2 (cf. §11.3.4) with 
componentwise multiplication. Thus, both operations on R.\ x i?2 are defined com- 
ponentwise: V(a 1 ,6i) € R\, V(ti2i bi) € i?2, 

(ai, 6 j) + (02.^2) == (“1 + “2**1 + *>2)* 

(ai,&i) - (<12. *>2) := (<>i • <&2.&i • h ) ■ 

The identity in R,i x Rg is (1/?, , lfi 3 ). The reader will check (Exercise 2.13) that 
R-i x R 2 is indeed a categorical product in Ring. 

The reader should keep in mind that in general this is not the only ring struc- 
ture one can define on the direct product of underlying groups. For example, as 
mentioned in §1.2, one can define a field whose underlying group is Z/pZ x Z/pZ, 
while the product ring Z/pZ x Z/pZ is very far from being a field (why?). 

The situation with coproducts is more unfortunate— dealing with this in any 
generality (even for the case of commutative rings) requires tensor products, and 
by the time we develop tensor products (§VIII.2), we will have almost forgotten 
this question. But the universal property reviewed in §2.2 suffices to deal with 
simple examples, and the reader should work out one template case now, for fun 
(Exercise 2.14). 

Incidentally — with the case of abelian groups in mind (§11.3.5), the reader may 
be tempted to consider a ‘direct sum of rings’, agreeing with the product in the finite 

^Unfortunately, some references define ring epimorphisms as 'surjective ring hotnomor- 
phisms’; this should be discouraged. 

9 As they must, since Z is initial. 

10 Note that a(g) must have a double-sided inverse in R since q has one in Q, namely 1 /q. 
In particular, ai(q -1 ) = 012 ( 9 “') must agree, since they must equal this unique inverse of ct(q): 
cf. Proposition 1.12. 
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case and maybe giving something interesting in general. This is less promising than 
it looks: it does not satisfy the universal property of coproducts in Ring (why?), 
and, further, the ‘infinite version’ is not a ring because it is missing the identity. 

2.5. EudAb(G). This is as good a place as any to expand a little on one of the main 
examples of rings, and one that is not quite as special as the examples reviewed in 
§ 1 . 2 . 

For every abelian group G, the group EndAb(G') := HoniAb(G, G) of endo- 
morphisms of G is a ring, under the operations of addition and composition (as 
discussed in the paragraph preceding Definition 1.1). Indeed, associativity follows 
from the category axioms, and distributivity is inunediately checked. 

It is instructive to examine carefully the endomorphism ring of the abelian 
group (Z,+): 

Proposition 2.6. EndAb (Z) =f Z as rings. 

Proof. Consider the function 


ip : EndAb(Z) — ► Z 


defined by 


<P(°) = <*(1) 


for all group hoinomorphisms a : Z -» Z. Then ip is a group homomorphism: the 
addition in Endab(^) is - defined so that Vn € Z 


(a + fi)(n) = et(n) + fl(n) 


(cf. §11.4.4); in particular 

ip(a + p) = (a + /?)(1) = a(l) + p(l) = tp(a) + <p(/J). 

Further, ip is a ring homomorphism. Indeed, for a, fi in EndAb (2) denote a(l) 
by a\ then 

a(n) = na(l) — na = an 

for all n e Z; in particular, 


«(/?(!)) = «J9(1) = «(!)/?(!). 


Therefore, 

tp(ao0) = (no /3)(1) = a((3( 1)) = n(l)/?(l) = p(a)p(p) 

as needed. Also, v?(idz) = idz(l) - 1. 

Finally, <p has an inverse: for a € Z, let ^(a) be the homomorphism or : Z — * Z 
defined by 

(Vn 6 Z) : a(») = on; 

the reader will easily check that $ is a ring homomorphism and inverse to p. 
Therefore ip is a ring isomorphism, verifying the statement. □ 
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Proposition 2.6 gives a sense in which the ring structure on Z is truly ‘natu- 
ral’: this structure arises naturally from categorical considerations, there is nothing 
‘arbitrary’ about it. 

More generally, the rings EndAb(G) and other rings arising as endomorphism 
rings of other structures (e.g., vector spaces) are arguably the most important class 
of examples of rings. The entire theory of modules is based on the observation that 
EndAb(G) is a ring for every abelian group G. 

Some of the notions reviewed in §1.2 are expressed very concretely in these 
endomorphism rings; for example, the group of units in EndAb(C) is nothing hut 
AutAb(G). 

Every ring R interacts with the ring of endomorphisius of the underlying abelian 
group (J?, +). For r £ R, define left- and right-multiplication by r by A P) fi r , 
respectively. That is, V« e R 

A r (a) = ra, p r (a) = ar. 

The following observation is nothing more than easy notation juggling, but it is 
useful; it is a ‘ring analog’ of Cayley’s theorem (Theorem 11.9.5): 

Proposition 2.7. Let R be a ring. Then the function r <-> \ r is an injective ring 
homomorphism 

A : R -* EudAb(ft)- 

Proof. For any r € R and for all a, b 6 R, distributivity gives 

A r (a + b) = r(a + 6) = ra + r6 = A r (a) + A r (fe) : 

this shows that A r is indeed an endomorphism of the group (R, +), that is, A r e 
EndAb(B). 

The function A : R —* EndAb(P) defined by the assignment r »-► A T is clearly 
injective, since if r ^ s, then 

A r (l) = r ^ s = A.(l), 

so that A r ^ A,. 

We have to verify that A is in fact a homomorphism of rings. Recall that the 
addition in EndAb(-R) is inherited from R. (cf. §11.4.4): for all r, s G R, A r + A, is 
defined by 

(Va € R) : (A r + A*)(a) = A r (a) + A,(a). 

Thus, the fact that A preserves addition is an immediate consequence of distribu- 
tivity: for all r,s,a e R, 

A r+S (a) = (r + s)a = ra + sa = A r (a) + A fl (a). 

As for multiplication, it is associativity’s turn to do its job: 

Af j(tt) ~ (rs)a ~ r(sa) — r A, (a) — A p (A 3 (a)) — (A P o A a )(a). 

Of course Aj is the identity, completing the verification. □ 




136 


III. Rings and modules 


The function fi : R — * EndAb(i?) defined by r i~+ p r is ‘almost’ a ring homo- 
morphism: the reader will check (Exercise 2.18) that 

Hr+a — Hr ~t" Ha, 

firs = H* O Hr, 

and Hi = id/j. That is, p would in fact be a ring homomorphism if we ‘reversed’ 
multiplication 11 in R. Of course this issue disappears if R happens to be commu- 
tative (since then A = ji). 


Exercises 


2.1. > Prove that if there is a homomorphism from a 2ero-ring to a ring R, then R 
is a zero-ring [§2.1] 

2.2. Let R and S be rings, and let <p :/?.—> S be a function preserving both 
operations +, •. 

• Prove that if ip is surjective, then necessarily yj(ltf) = 1.S'- 

• Prove that if <p 0 and 5 is an integral domain, then <p{1r) = Is. 

(Therefore, in both cases ip is in fact a ring homomorphism). 

2.3. Let 5 be a set, and consider the power set ring &{S) (Exercise 1.2) and 
the ring (2/ 2Z) S you constructed in Exercise 1.3. Prove that these two rings are 
isomorphic. (Cf. Exercise 1.2.11.) 


2.4. Define functions Q — * 0(4 (R) and M — * 012(C) (cf. Exercises 1.4 and 1.12) by 


a + bi + cj + dk *-* 


f a 
~b 
-c 

V-d 


b c 
a —d 
d a 
—c b 



a + bi + cj + dk ~ + 


c + di 
a—bi 


for all a, b, c, d € R. Prove that both functions are injective ring homomorphisms. 
Thus, quaternions may be viewed as real or complex matrices. 


2.5. -i The norm of a quaternion w = a + bi + cj + dk, with a, b,c,d e R, is the 
real number N{w) = a 2 + b 2 + <? + d 2 . 

Prove that the function from the multiplicative group H* of nonzero quaternions 
to the multiplicative group R + of positive real numbers, defined by assigning to 
each nonzero quaternion its norm, is a homomorphism. Prow that the kernel of 
this homomorphism is isomorphic to SU2(C) (cf. Exercise II.6.3). [4.10, IV.5.17, 
V.6.19] 


^This issue is entirely analogous to the business of right-actions vs. left-actions; cf. §11.9.3, 
especially Exercise II. 9.3. 
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2.6. > Verify the ‘extension property’ of polynomial rings, stated in Example 2.3. 

[§2.2j 

2.7. [> Let R = TLIITL, and let f(x) = x 2 -x\ note /(x) ^ 0. What is the polynomial 
function R-*R determined by /(x)? [§2.2, §V.4.2, §V.5.1] 

2.8. Prove that every subring of a field is an integral domain. 

2.9. -> The center of a ring R consists of the elements a such that ar = ra for all 
r € R. Prove that the center is a subring of R. 

Prove that the center of a division ring is a field. [2.11, 1V.2.17, VII.5.14, 
VII.5.16] 

2.10. -» The centralizer of an element o of a ring R consists of the elements r € R 
such that ar = ra. Prove that the centralizer of a is a subring of R, for every a € R.. 

Prove that the center of R. is the intersection of all its centralizers. 

Prove that every centralizer in a division ring is a division ring. [2.11, IV.2.17, 
VII.5.16) 

2.11. -> Let R. be a division ring consisting of p 2 elements, where p is a prime. 
Prove that R is commutative, as follows: 

• If R is not commutative, then its center C (Exercise 2.9) is a proper subring of 
R. Prove that |C| would then consist of p elements. 

• Let r e R, r £ C. Prove that the centralizer of r (Exercise 2.10) contains both 
r and C. 

• Deduce that the centralizer of r is the whole of R. 

• Derive a contradiction, and conclude that R. had to be commutative (hence, a 
field). 

This is a particular case of Wedderburn’s theorem: every finite division ring is a 
field. {IV.2.17, VII.5.16) 

2.12. > Consider the inclusion map i : Z <-> Q. Describe the cokernel of i in Ab 
and its cokernel in Ring (as defined by the appropriate universal property in the 
style of the one given in §11.8.6). [§2.3, §5) 

2.13. > Verify that the ‘componentwise’ product R\ x R? of two rings satisfies the 
universal property for products in a category, given in §1.5.4. (§2.4) 

2.14. t> Verify that Zfxi, 22 ] (along with the evident morphisms) satisfies the uni- 
versal property for the coproduct of two copies of Z[z] in the category of commu- 
tative rings. Explain why it does not satisfy it in Ring. [§2.4] 

2.15. p For m > 1, the abelian groups (Z, +) and (mZ,+) are manifestly iso- 
morphic: the function : Z — > mZ, n i-» mn is a group isomorphism. Use this 
isomorphism to transfer the structure of ‘ring without identity’ (mZ, +, •) back 
onto Z: give an explicit formula for the ‘multiplication’ • this defines on Z (that 
is, such that <p(a • 6) = <p(a) ■ <p(b)). Explain why structures induced by different 
positive integers m are nonisomorphic as ‘rings without 1’. 
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(This shows that there are many different ways to give a structure of ring 
without identity to the group (Z, +). Compare this observation with Exercise 2.16.) 
[§ 2 - 1 ] 

2.16. t> Prove that there is (up to isomorphism) only one structure of ring with 
identity on the abelian group (Z, +). (Hint: Let R. be a ring whose underlying 
group is Z. By Proposition 2.7, there is an injective ring homomorphism A : R. -+ 
EndAb(#), and the latter is isomorphic to Z by Proposition 2.6. Prove that A is 
surjective.) (§2.1, 2.15] 

2.17. -i Let R. be a ring, and let E = EndAb( j R) be the ring of endomorphisms of 
the underlying abelian group ( R , +). Prove that the center of £ is isomorphic to 
the center of R. (Prove that if a € £ commutes with all right-multiplications by 
elements of R, then a is left-multiplication by an element of R; then use Propo- 
sition 2.7.) In particular, this shows that if R. is commutative, then the center of 
EndR(J?.) is isomorphic to R. [VIII.3.15] 

2.18. > Verify the statements made about right-multiplication p, following Propo- 
sition 2.7. [§2.5] 

2.19. Prove that for n € Z a positive integer, EndAb(Z/nZ) is isomorphic to Z/nZ 
as a ring. 


3. Ideals and quotient rings 

3.1. Ideals. In both Set and Grp we have been able to ‘classify’ surjective mor- 
phisms: in both cases, a surjective morphism is, up to natural identifications, a 
quotient by a (suitable) equivalence relation. In Grp, we have seen that such equiv- 
alence relations arise in fact from certain substructures: normal subgroups. 

The situation in Ring is analogous. We will establish a canonical decomposition 
for rings, modeled after Theorems 1.2.7 and II.8.1; the corresponding version of the 
‘first isomorphism theorem’ (Corollary II.8.2) will identify every surjective ring 
homomorphism with a quotient by a suitable substructure. The role of normal 
subgroups will be played by ideals. 

Definition 3.1. Let R. be a ring. A subgroup / of (R, -f ) is a left-ideal of R if 
r/CJ for all r € R', that is, 

(Vr € R)(Va 6 I) : rael; 

it is a right- ideal if Jr C / for all r e R; that is, 

(Vr € R.)(Va € I) : ar g I. 

A two-sided ideal is a subgroup I which is both a left- and a right-ideal. j 

Of course in a commutative ring there is no distinction between left- and right- 
ideals. Even in the general setting, we will almost exclusively be concerned with 
two-sided ideals; thus I will omit qualifiers, and an ideal of a ring will implicitly be 
two-sided. 
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Remark 3.2. As seen in §2.3, a subring of R is a subset SCR. which contains \p 
and satisfies the ring axioms with respect to the operations +, • induced from R. 

Ideals are close to being subrings: they are subgroups, and they are closed 
with respect to multiplication. But the only ideal of a ring R. containing Is is 
R itself: this is an immediate consequence of the ‘absorption properties’ stated in 
Definition 3.1. Thus ideals are in general not subrings; they are ‘mgs’. j 

Ideals are considerably more important than subrings in the development of 
the theory of rings 12 . Of course the image of a ring homomorphism is necessarily a 
subring of the target; but a lesson learned from §11.8.1 and following is that kernels 
really capture the structure of a homomorphism, and kernels are ideals: 

Example 3.3. Let <p: R-*S be any ring homomorphism. Then kerg> is an ideal 
of R. 

Indeed, we know already that ker^> is a subgroup; we have to verify the absorp- 
tion properties. These are an immediate consequence of Lemma 1.2: for all r 6 R, 
all a € ker <p, we have 

<p(ra) = <p(r)tp(a) = 0 • a = 0 , 

<p(ar) = ip(a)<p(r) = a ■ 0 = 0 . 

More generally, it is easy to verify that the inverse image of an ideal is an ideal 
(Exercise 3.2) and {Os} is clearly an ideal of S. 

Similarly to the situation with normal subgroups in the context of groups, we 
will soon see that ‘kernels of ring homomorphisms’ and ‘ideals’ are in fact equivalent 
concepts. _j 

3.2. Quotients. Let 7 be a subgroup of the abelian group (7?, +) of a ring R. 
Subgroups of abelian groups are automatically normal, so we have a quotient group 
R/I, whose elements are the cosets of I: 

r + I 

(written, of course, in additive notation). Further, we have a surjective group 
homomorphism 

R, 

7r : R r h* r + /. 

As we have explored in great detail for groups, this construction satisfies a suit- 
able universal property with respect to group homomorphisms (Theorem II.7.12). 
Of course we are now going to ask under what circumstances this construction can 
be performed in Ring, satisfying the analogous universal property with respect to 
ring homomorphisms. 

That is, what should we ask of 7, in order to have a ring structure on R/I, 
so that 7r becomes a ring homomorphism? Go figure this out for yourself, before 
reading ahead! 


12 Arguably, the reason is that ideals are precisely the submodules of a ring R. 
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As so often is the case, the requirement is precisely what tells us the answer. 
Since ir will have to be a ring homomorphism, the multiplication in R./I must be 
as follows: for all (a + /), (b + /) in R/I, 

(a + I) ■ (6 + /) = 7r(o) • ir (6) = ir(ab) = ab + J, 

where the equality = is forced if n is to be a ring homomorphism. 

This says that there is only one sensible ring structure on R/I, given by 
(Vo, beR): (a + I)(b+ I) := ab+ I. 

The reader should realize right away that if this operation is well-defined, then it 
does make R./I into a ring: associativity will be inherited from the associativity in 
jR, and the identity will simply he the coset 1 + / of 1 . 

So all is well, if the proposed operation is well-defined. But is it going to be 
well-defined? 

Example 3.4. It need not be, if / is an arbitrary subgroup of R. For example, 
take Z as a subgroup of Q; then 

0 + Z = 1 -f Z 

(= Z) as elements of the group Q/Z, and 5 + Z is another coset, yet 

0-i+Z = Zy*i+Z=l-i + Z. j 

z z z 

Assume then that the operation is well-defined, so that R./I is a ring and 
ir : R—f R/I is a ring homomorphism. What does this say about I! 

Answer: I is the kernel of R — » R/I, so necessarily I must be an ideal, as seen 
in Example 3.3! 

Conversely, let us assume I is an ideal of R, and verify that the proposed 
prescription for the operation in R/I is well-defined. For this, suppose 

a! 4 - 1 = a! r + 1 and 1/ 4- I = b r/ + I ; 
recall that this means that a" — a' € I, V — R G I\ then 

a"b" - a'l / = a"b" - a"b' + a" if - a'b' = a"(b" - b') + («" - «') 6 ' € /, 

using both the left-absorption and right-absorption properties of Definition 3.1. 
This says precisely that 

a'bf + / = a"bf' + 1, 

proving that the operation is well-defined. 

Summarizing, we have verified that R/I is a ring, in such a way that the 
canonical projection 7 r : R — * R/I is a ring homomorphism, if and only if I is an 
ideal of R.. 

Definition 3.5. This ring R/I is called the quotient ring of R modulo I. j 

Example 3.6. We know that all subgroups of (Z,+) are of the form nZ for a 
nonnegative integer n (Proposition II.6.9). It is immediately verified that all sub- 
groups of Z are in fact ideals of the ring (Z, +, •). The quotients Z/nZ are of course 
nothing but the rings so-denoted in § 1.2 (and earlier). 
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The fact that 2 is initial in Ring now prompts a natural definition. Fbr a ring R, 
let / : Z — » R be the unique ring homomorphism, defined by a w a ■ lyj. Then 
ker / = nZ for a well-defined nonnegative integer n determined by R. 

Definition 3.7. The characteristic of R is this nonnegative integer n. j 

Thus, the characteristic of R is n > 0 if the order of 1 r as an element of (R., +) 
is a positive integer n, while the characteristic is 0 if the order of 1 r is oo. _i 

We have now fulfilled our promise to identify the notions of ideal and kernel of 
ring homomorphisuis: every kernel is an ideal (cf. Example 3.3); and on the other 
hand every ideal I is the kernel of the ring homomorphisms R. — » R/I. We have 
the slogan: 

kernel ideal 

in the context of ring theory. 

The key universal property holds in this context as it does fbr groups; cf. The- 
orem II.7.12. We reproduce the statement here for reference, but the reader should 
realize that very little needs to be proven at this point: the needed (group) ho- 
momorphism exists and is unique by Theorem II.7.12, and verifying it is a ring 
homomorphism is immediate. 

Theorem 3.8. Let I be a two-sided ideal of a ring R. Then for every ring homo- 
morphism <p : R. — > S such that I C ker <p there exists a unique ring homomorphism 
i p : R/I —* S so that the diagram 



R/I 


commutes. 

As a reminder to the lazy reader, <p is defined by 

<p(r + I) := vK r ): 

(part of) the content of the theorem is that this function is well-defined (if / C 
ker <p), and it is a ring homomorphism. 

3.3. Canonical decomposition and consequences. As the reader should now 
expect, Theorem 3.8 is the key element in a standard decomposition of every ring ho- 
momorphism. This is entirely analogous to the decomposition of set-fimctions stud- 
ied in §1.2.8 and the decomposition of group homomorphisms obtained in §11.8.1. 
Here is the statement: 

Theorem 3.9. Every ring homomorphism <p : R —* S may be decomposed as 
follows: 

v 
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where the isomorphism </> in the middle is the homomorphism induced by tp (os in 
Theorem 3.S). 

The reader will realize that this statement requires no proof at this point: the 
decomposition holds at the level of groups (by Theorem II.8.1) and the maps are 
all ring homomorphisms as observed earlier in this section. 

The ‘first isomorphism theorem’ for rings is an immediate corollary: 

Corollary 3.10. Suppose <p: R —> S is a surjective ring homomorphism. Tlien 

ker<p 

As in the group case, the reader should develop the healthy instinctive reaction 
of viewing every surjective homomorphism of rings as a quotient (by an ideal), up 
to a natural identification. 

Equally instinctive should be the realization that ideals of a quotient R./I are in 
one-to-one correspondence with ideals of R containing I. Once more, not a whole 
lot is new here: we already know (cf. §11.8.3) that the function 

u : {subgroups J of R. containing 1} — > {subgroups of R/I} 

defined by u(J) = J/I is a bijection preserving inclusions; it takes a moment to 
check that J/I is an ideal of R./I if and only if J is an ideal of R. The main content of 
this observation is best packaged in the corresponding ‘third isomorphism theorem’: 

Proposition 3.11. Let I be an ideal of a ring R, and let J be an ideal of R. 
containing I. Then J/I is an ideal of R/I, and 

j/i ~ j ■ 

Proof. Since I C J = ker {R. —* R/J), we have an induced ring homomorphism 

tp : R/I — * R/J 

by Theorem 3.8. Explicitly, <p(r + 1) = r + <7; tp is manifestly surjective. Since 
ker tp = {r + / 1 tp(r + J) = J} = {r + 1 \ r + J = J} = {r + I \ r € J} = J/I, 

we see that J/I is an ideal (since it is a kernel), and the stated isomorphism follows 
from Corollary 3.10. □ 

What about the ‘second’ isomorphism theorem? This would be a relation 
between the ideals 

I+J J 
I ’ in J 

of the rings R./I, R/ (I n J), respectively, assuming I and J are ideals of R. (and 
where it is immediately checked that I + J and In J are indeed ideals 13 ). 

The reader may want to go back to §11.8.4 for the version of this story for 
groups. Our feeling is that Ring is not the best place to play this game, since 

13 The notation I + J should not surprise the reader: I, J are subgroups of the abelian 
group R, so we know what it means to ‘add* them; cf. §11.7.1. 
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(/ + J)/I, J/(I H J) are not even rings according to my conventions. Modules will 
be a more natural context for this result. 

In any case, the reader will benefit the most from exploring the matter on 
his/her own; cf. Exercise 3.17. 


Exercises 


3.1. Prove that the image of a ring homomorphism <f > : B — > S is a subring of S. 
What can you say about ip if its image is an ideal of • S'! What can you say about <p 
if its kernel is a subring of Ri 

3.2. > Let ip : R -* S be a ring homomorphism, and let J be an ideal of S. Prove 

that I = is an ideal of B. [§3.1] 

3.3. -i Let ip : R —» S be a ring homomorphism, and let J be an ideal of B. 

• Show that <p(J) need not be an ideal of S. 

• Assume that <p is surjective; then prove that is an ideal of S. 

• Assume that <p is surjective, and let I = ker <p; thus we may identify S with B.jl. 
Let 7 = <p(J), an ideal of B/I by the previous point. Prove that 

R/i~ R 

7 i + J' 

(Of course this is just a rehash of Preposition 3.11.) [4.11] 

3.4. Let J? be a ring such that every subgroup of (B, +) is in fact an ideal of B.. 
Prove that j? = Z/nZ, where n is the diaracteristic of B. 

3.5. -i Let J be a two-sided ideal of the ring M n (B) of nxn matrices over a ring B. 
Prove that a matrix A € M n (B) belongs to J if and only if the matrices obtained 
by placing any entry of A in any position, and 0 elsewhere, belong to J. (Hint: 

Carefully contemplate the operation (jgjj) i o 8 ) = (?oo)^ 

3.6. -> Let J be a two-sided ideal of the ring M»(B) of nxn matrices over a 
ring /?, and let I C B be the set of (1, 1) entries of matrices in J. Prove that I is 
a two-sided ideal of B. and J consists precisely of those matrices whose entries all 
belong to I. (Hiut: Exercise 3.5.) [3.9) 

3.7. Let R be a ring, and let a g R. Prove that Ra is a lefbideal of R. and aR. is 
a right-ideal of R. Prove that o is a left-, resp. right-, unit if and only if R = aR, 
resp. R. = Ra. 

3.8. i> Prove that a ring R, is a division ring if and only if its only left-ideals and 
right-ideals are {0} and R- 

In particular, a commutative ring I? is a field if and only if the only ideals of R 
are {0} and R. [3.9, §4.3] 
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3.9. -i Counterpoint to Exercise 3.8: It is not true that a ring R is a division 
ring if and only if its only two-sided ideals are {0} and R. A nonzero ring with this 
property is said to be simple; by Exercise 3.8, fields are the only simple commutative 
rings. 

Prove that A4 n (R) is simple. (Use Exercise 3.6.) [4.20] 

3.10. t> Let ip : k —> R. be a ring homomorphism, where k is a field and R is a 
nonzero ring. Prove that ip is injective. [§V.4.2, §V.5.2] 

3.11. Let R be a ring containing C as a subring. Prove that there are no ring 
homomorphisms R. — * R. 

3.12. t> Let R be a commutative ring. Prove that the set of nilpotent elements of 
R is an ideal of R. (Cf. Exercise 1.6. This ideal is called the nilradical of R.) 

Find a noncommutative ring in which the set of nilpotent elements is nof an 
ideal. [3.13, 4.18, V.3.13, §VII.2.3] 

3.13. -> Let flbea commutative ring, and let N be its nilradical (cf. Exercise 3.12). 
Prove that R/N contains no nonzero nilpotent elements. (Such a ring is said to be 
reduced.) [4.6, V1I.2.8] 

3.14. -i Prove that the characteristic of an integral domain is either 0 or a prime 
integer. Do you know any ring of characteristic 1? [V.4.17] 

3.15. -i A ring R is 14 Boolean if a 2 = a for all a 6 R. Prove that &(S) is Boolean, 
for every set S (cf. Exercise 1.2). Prove that every Boolean ring is commutative, and 
has characteristic 2. Prove that if an integral domain R. is Boolean, then R = Z/2Z. 
[4.23, V.6.3] 

3.16. -> Let S be a set smd T C S a subset. Prove that the subsets of S contained 
in T form an ideal of the power set ring &(S). Prove that if S is finite, then every 
ideal of ■'P(S) is of this form. For S infinite, find an ideal of &(S) that is not of 
this form. [V.1.5] 

3.17. > Let J, J be ideals of a ring R. State and prove a precise result relating the 
ideals (J + J)/I of R/I and J/(I D J) of R/(I n J). (§3.3] 


4. Ideals and quotients: Remarks and examples. 

Prime and maximal ideals 

4.1. Basic operations. It is often convenient to define ideals in terms of a set of 
generators. 

Let a € J? be any element of a ring. Then the subset / = Ra of R. is a left-ideal 
of R. Indeed, for all r 6 R we have 

rl = rR.a C Ra 

as needed. Similarly, aR is right-ideal. 


14 After George Boole. 
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In the commutative case, these two subsets coincide and are denoted (a). This 
is the principal ideal generated by a. For example, the zero-ideal {0} = (0) and the 
whole ring R = (1) are both principal ideals. 

In general (Exercise 4.1), if {/ a }agA is a family of ideals of a ring R, then the 
sum y~) a I a is an ideal of R.. If a a is any collection of elements of a commutative 
ring R, then 

(«o)a€A := 5Z («tr) 
a€A 

is the ideal generated by the elements a a . In particular, 

(«i> ■ ■ •,«») = (<*i) H 1- («7i) 

is the smallest ideal of R containing cq, . . . ,On> this ideal consists of the elements 
of R that may be written as 


nai + • • + r„o n 

for rj, . . . ,r„ € R- An ideal I of a commutative ring R. is finitely generated if 
I = (oi, . . . , a„) for some oi, . . . , € R. 


Example 4.1. It is a good idea to get used to a bit of ‘calculus’ of ideals and 
quotients in terms of generators; judicious use of the isomorphism theorems yields 
convenient statements. For example, let R. be a commutative ring, and let a, b € if; 
denote by ? the class of b in R/{a). Then 

(R./(a))/{l)~R/(a,b). 


Indeed, this is a particular case of Proposition 3.11, since 


(b) = 


(«,*) 

(<*) 


as ideals of R/(a). 


-i 


Note that principal ideals are (very special) finitely generated ideals. These 
notions are so important that we give special names to rings in which they are 
satisfied by every ideal. 

Definition 4.2. A commutative ring R is Noetherian if every ideal of R is finitely 
generated. _i 

Definition 4.3. An integral domain R is a PID (‘Principal Ideal Domain’) if every 
ideal of R is principal. j 

Thus, PIDs are (very special) Noetherian rings. In due time we will deal at 
length with these classes of rings (cf. Chapter V); Noetherian rings are very impor- 
tant in number theory and algebraic geometry. 

The reader is already familiar with an important PID: 

Proposition 4.4. Z is a PID. 


Proof. Let I C Z be an ideal. Since / is a subgroup, I = nZ for some n € Z, by 
Proposition II.6.9. Since nZ = («), this shows that I is principal. □ 
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The fact that Z is a PID captures precisely ‘why’ greatest common divisors 
behave as they do in Z: if m, n are integers, then the ideal (m, n) must be principal, 
and hence 

(m, n) = (d) 

for some (positive) integer d. This integer is manifestly the gcd of m and n: since 
m € (d) and n 6 (d), then d | m and d | n, etc. 

If k is a held, the ring of polynomials fc[x] is also a PID; proving this is easy, 
using the ‘division with remainder’ that we will run into very soon (§4.2); the reader 
should work this out on his/her own (Exercise 4.4) now. This fact will be absorbed 
in the general theory when we review the general notion of ‘Euclidean domain’, in 
§V.2.4. 

By contrast, the ring Z[x] is not a PID: indeed, the reader should be able to 
verify that the ideal (2, x) cannot be generated by a single element. As we will see 
in due time, greatest common divisors make good sense in a ring such as Z[x], but 
the matter is a little more delicate, since this ring is not a PID 15 . 

There are several more basic operations involving ideals; for now, the following 
two will suffice. 

• Again assume that {/ 0 } Qg/ 4 is a collection of ideals of a ring R. Then the 
intersection flaeA (clearly) an ideal of R; it is the largest ideal contained 
in all of the ideals I a . 

• If I, J are ideals of R , then IJ denotes the ideal generated by all products 

ij with i € I, j € J. More generally, if are ideals in R, then the 

‘product’ 1\ • I n denotes the ideal generated by all products with 

h € h- 

The reader should note the clash of notation: in the context of groups (especially 
§11.8.4) IJ would mean something else. Watch out! 

It is clear that IJ C IC\J: every element ij with i € / and j € J is ml (because 
I is a right-ideal) and in J (because J is a left- ideal); therefore In J contains all 
products ij, and hence it must contain the ideal IJ they generate. Sometime the 
product agrees with the intersection: 

(4) n (3) = (12) = (4) (3) inZ; 
and sometime it does not: 

(4) n (6) =(12) *(24) = (4). (6). 

The matter of whether IJ = In J is often subtle; a prototype situation in which 
this equality holds is given in Exercise 4.5. 

4.2. Quotients of polynomial rings. We have already observed that the quo- 
tient Z/nZ is our familiar ring of congruence classes modulo n. Quotients of poly- 
nomial rings by principal ideals are a good source of ‘concrete’, but maybe less 
familiar, examples. 

15 It is, however, a ‘UFD\ that is, a ‘unique factorization domain'. This suffices for a good 
notion of gcd; cf. §V.2.1. 
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Let R. be a (nonzero) ring, and let 

f(x) = x d + aa-ix^ 1 + • • • -i- aix + <jq € J?[a;] 

be a polynomial; for convenience, we are assuming that f(x) is monic, that is, its 
leading coefficient (the coefficient of the highest power of x appearing in /(a;)) is 1. 
In terms of ideals, this is not a serious requirement if the coefficient ring R is a 
field (Exercise 4.7), but it may be substantial otherwise: for example, (2x) C Z[x] 
cannot be generated hy a monic polynomial. Also note that a monic polynomial 
is necessarily a uon-zero-divisor (Exercise 4.8) and that if f(x) is monic, then 
deg(f(x)q(x)) = deg f(x) + deg q(x) for all polynomials q(x). 

It is convenient to assume that f(x) is monic because we can then divide by 
f(x), with remainder. That is, if g(x) e /?(«] is another polynomial, then there 
exist polynomials q(x), r(x) 6 fi[x] such that 

?(*) = /(«)?(«) + »•(*) 

and 16 degr(®) < deg f(x). This is simply the process of ‘long division’ of polyno- 
mials, which is surely familiar to the reader, and can be performed over any ring 
when dividing by morac polynomials 17 . 

The situation appears then to be similar to the situation in Z, where we also 
have division with remainder. Quotients and remainders are uniquely 18 determined 
by g(x) and f(x): 

Lemma 4.5. Let f(x) be a monic polynomial, and assume 
f(x)qi(x) + n(x) = f(x)q 2 {x) + r 2 (x) 

with both ri(x) and r 2 (x) polynomials of degree < deg f(x). Then q\(x) = q 2 ( x) 
and ri(x) = r 2 (x). 

Proof. Indeed, we have 

f{x)(qi(x) - ©(«)) = r 2 (x) - ri(x)\ 

if r 2 (x) rj(a:), then ^(x) - ri(x) has degree < deg f(x), while f(x)(qi(x) — q 2 (x)) 

has degree > deg /(«), giving a contradiction. Therefore n (a:) = r 2 (a), and q\ (a:) = 
q 2 (x) follows right away since monic polynomials are non-zero-divisors. □ 

The preceding considerations may be summarized in a rather efficient way in 
the language of ideals and cosets. We will now restrict ourselves to the commutative 
case, mostly for notational convenience, but also because this will guarantee that 
ideals are two-sided ideals, so that quotients are defined as rings (cf. §3.2). 


ls Note: With the convention that the degree of the polynomial 0 is -oo, the condition 
degr(z) < deg/(*) is satisfied by r(a:) = 0. 

17 Tbe key point iB that if n > d, then for all a € R we have ax n = a x 1 '~ d f(x) + h(x) for some 
polynomial h(x) of degree < n. Arguing inductively, this Bhows that we may perform division 
by f(x) with remainder for all ‘monomials’ ax n , and lienee (by linearity) for all polynomials 
g\x) 6 

18 This assertion has to be taken with a grain of salt in the noncomrautative case, as different 
quotients and remainders may arise if we divide ‘on the left’ rather than ‘on the right’. 
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Assume then that R is a commutative ring. What we have shown is that, if 
/(re) is monic, then for every g(x) € R\x] there exists a unique polynomial r(x) of 
degree < deg f(x) and such that 

g(x) + (/(s)) = r(x) + (f(x)) 
as cosets of the principal ideal (/( re)) in i?,[rr]. 

Refining this observation leads to a useful group-theoretic statement. Note that 
polynomials of degree < d may he seen as elements of a direct sum 

N V / 

d times 

indeed, the function '■ R ad — * R[x] defined by 

<K(»’o ) n , • • • , rd_i)) = r 0 + nx + • • • + rd~ix d ~ 1 

is clearly an injective hoinomorphisni of abelian groups, hence an isomorphism 
onto its image, and this consists precisely of the polynomials of degree < d. We will 
glibly identify R® d with this set of polynomials for the purpose of the discussion 
that follows. 

The next result may be seen as a way to concoct many different and interesting 
ring structures on the direct sum i?® d : 

Proposition 4.6. Let R be a commutative ring, and let f(x) e R[as] be a monic 
polynomial of degree d. Then the function 

<p : J?.[*l -* f?.® d 

defined by sending g(x) 6 R[x] to the remainder of the division of g(x) by f(x) 
induces an isomorphism of abelian groups 

</(*)) ’ 

Proof. The given function ip is well-defined by Lemma 4.5, and it is surjective 
since it has a right inverse (that is, the function t/> : ,R® d —* f?.[x] defined above). 
We claim that <p is a homomorphism of abelian groups. Indeed, if 
5i(«) = f(x)qi(x) + n(x) and g 2 {x) = f(x)q 2 (x) + r 2 ( x) 
with degrx(s) < d, degr 2 (x) < d, then 

3i(*) + 9i(x) = /WfeW + q 2 (x)) + (ri(a-) + r 2 (x)) 
and deg(ri(a:) + r 2 (x)) < d: tins implies (again by Lemma 4.5) 

<p(9i( x ) + 32 (x)) = r,(x) + r 2 (x) = <p(gi(x)) + <p(. 92 (x)). 

By the first isomorphism theorem for abelian groups, then, ip induces an iso- 
morphism 

kery> 

On the other hand, ip(g(x)) = 0 if and only if g(x) = /(x)g(x) for some q(x) € R[x], 
that is, if and only if g(x) is in the principal ideal generated by f(x). This shows 
kery> = (/(*)), concluding the proof. □ 
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Example 4.7. Assume f(x) is monic of degree 1: f(x) = x — a for some a € R. 
Then the remainder of g(x) after division by f(x) is simply the ‘evaluation’ g( a) 
(cf. Example 2.3): indeed, 

g(x) - (x- a)q(x) + r 

for some r € R. (the remainder must have degree < 1; hence it is a constant); 
evaluating at a gives 

g(a) - (a - a)q(a) + r = 0 • g(a) + r-r 

as claimed. In particular, g(a) = 0 if and only if g(x) 6 (.r — a) . 

The content of Proposition 4.6 in this case is that the evaluation map 


R[x\ — ► if., g{x) p-* g(a) 


induces an isomorphism 


m 

(x - a) 


“if 


of abelian groups; the reader will verify (either by hand or by invoking Corol- 
lary 3.10) that this is in fact an isomorphism of rings. j 


Example 4.8. It is fun to analyze higher-degree examples. For every monic 
f(x) e if[x] of degree d. Proposition 4.6 gives a different ring (that is, i?[a:]/(/(a:))) 
isomorphic to R® d as a group; one can then use tins isomorphism to define a new 
ring structure onto the group if® rf . 

For d = 1 all these structures are isomorphic (as seen in Example 4.7): but 
interesting structures already arise in degree 2. For a concrete example, apply this 
procedure with f(x) = x 2 + 1: Proposition 4.6 gives an isomorphism of groups 


ifffiif S 


AM . 

(* 2 + l)’ 


what multiplication does this isomorphism induce on if © if? Take two elements 
(ao, ai), (6 0 ,ii) of if © if. With the notation used in Proposition 4.6, we have 


(«o,«i) = <p(ao + aia:), (&o,&i) = <p(l>o + & 1 *). 


Now a bit of high-school algebra gives 

(«o + ai«)(&o + hx) = aobo + (aof>i + ai&o)# + etihx^ 

= (a: 2 + l)ai6i + ((ao&o - “l^i) + (no&l + ni&o) 1 ) 


which shows 


V?((ao 4- flis)(6o + &i-r)) = (ao&o - ai&i, aof>i + aiio). 

Therefore, the multiplication induced on if © if by this procedure is defined by 

(ao, or) • (&o,&i) = (ao&o — + ai&o)- 

This recipe may seem somewhat arbitrary, but note that upon taking if — R, the 
ring of real numbers, and identifying pairs (», y) e R © R with complex numbers 
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x + iy, the multiplication we obtained on R © HI matches precisely the ordinary 
multiplication in C. Therefore, 

(x 2 + 1) 

os rings. In other words, this procedure constructs the ring C ‘from scratch’, 
starting from R[x]. j 

The point is that the polynomial equation x 2 + 1 = 0 has no solutions in R; 
the quotient R[x]/(a: 2 + 1) produces a ring containing a copy of R and in which the 
polynomial does have roots (that is, ± the class of x in the quotient). The fact that 
this turns out to be isomorphic to C may not be too surprising, considering that C 
is precisely a ring containing a copy of R and in which x 2 + 1 does have roots (that 
is, ±»). 

Such constructions are the “algebraist’s way” to solve equations. We will come 
back to them in §V.5.2, and in a sense the whole of Chapter VII will be devoted to 
this topic. 

4.3. Prime and maximal ideals. Other ‘qualities’ of ideals are best expressed 
in terms of quotients. We are still assuming that our rings are commutative — both 
due to unforgivable laziness and because that is the only context in which we will 
use these notions. 

Definition 4.9. Let I ^ (1) be an ideal of a commutative ring R. 

• I is a prime ideal if R/I is an integral domain. 

• / is a maximal ideal if Rjl is a held. _i 

Example 4.10. For all a e R , the ideal (x— a) is prime in i?.(x] if and only if R is an 
integral domain; it is maximal if and only if R is a field. Indeed, J?[x]/(.t - a) Sf R, 
as we have seen in Example 4.7, 

The ideal (2, x) is maximal in Z[x], since 

ZM l ZM/M ~r. 

(lSr~( 2j-‘W)~ 1 

is a Held (for the isomorphism =, cf. Example 4.1). j 

Of course these notions may be translated into terms not involving quotients 
at all, and it is largely a matter of aesthetic preference whether prime and max- 
imal ideals should be defined as in Definition 4.9 or by the following equivalent 
conditions: 

Proposition 4.11. Let I ^ (1) 6e an ideal of a commutative ring R.. Then 

• I is prime if and only if for all a, 6 € R. 

ab € I => (a € I or b € T ) ; 

• I is maximal if and only if for all ideals J of R 

ICJ =s> ( I=JorJ=R ). 
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Proof. The ring R/I is an integral domain if and only if Va, b € R/I 
a-6 = 0 => (a = 0 or 6 = 0). 

This condition translates immediately to the given condition in R, with o = a + I, 
6=6 + /, since the 0 in R/I is I. 

As for maximality, the given condition follows from the correspondence between 
ideals of R/I and ideals of R containing / (§3.3) and the observation that a conmiu- 
tative ring is a field if and only if its only ideals are (0) and (1) (Exercise 3.8). □ 

EVom the formulation in terms of quotients, it is completely clear that 
maximal =*■ prime; 

indeed, fields are integral domains. This fact is of course easy to check in terms of 
the other description, but the argument is a little more cumbersome (Exercise 4.14). 
Prime ideals are not necessarily maximal, but note the following: 

Proposition 4.12. Let I be an ideal of a commutative ring R.. If R/I is finite, 
then I is prime if and only if it is maximal 

Proof. This follows immediately from Proposition 1.15. □ 

For example, let (n) be an ideal of Z, with n > 0; then 

(n) prime 4=+ (n) maximal 4=3- n is prime as an integer. 

Indeed, for nonzero n the ring Z/nZ is finite, so Proposition 4.12 applies; cf. Ex- 
ample 1.17. 

In general, the set of prime ideals of a commutative ring R. is called 19 the 
spectrum of R, denoted Spec R. We can ‘draw’ Spec Z as follows: 



Actually, attributing the fact that nonzero prime ideals of Z are maximal to 
the finiteness of the quotients (as we have just done) is slightly misleading; a better 
‘explanation’ is that Z is a PID, and this phenomenon is common to all PEDs: 

Proposition 4.13. Let R. be a PID, and let I be a nonzero ideal in R.. Then I is 
prime if and only if it is maximal. 

Proof. Maximal ideals are prime in every ring, so we only need to verify that 
nonzero prime ideals are maximal in a PID; we will use the characterization of 
prime and maximal ideals obtained in Proposition 4.11. Let I = (a) be a prime 
ideal in R, with a ^ 0, and assume /Cl for an ideal of R. As R. is a PID, J = (6) 

19 Believe it or not, the term is borrowed from functional analysis. 
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for some b € R. Since / = (a) C (6) = J, we have that a = be for some c € R. But 
then b G (o) or c G (a), since I = (a) is prime. 

If b G («), then (b) C (a); and I = J follows. If e G (ft), then c = da for some 
d € R- But then 

o = be = bda, 

from which bd = l since cancellation by the nonzero a holds in R (since R is an 
integral domain). This implies that b is a unit, and hence J — (b) = R. 

That is, we have shown that if I C J, then either I — J or J - R: thus I is 
maximal, by Proposition 4.11. □ 

Example 4.14. Let k be a field. Then nonzero prime ideals in fc[x] are maximal, 
since fc[.i] is a PID (as the reader has hopefully checked by now; cf. Exercise 4.4). 
Therefore, a picture of Specfc[x] would look pretty much like the picture of Spec Z 
shown above: with the maximal ideals at one level, all containing the prime (and 
noinnaximal) ideal (0). 

This picture is particularly appealing for fields such as k = C, which are alge- 
braically closed, that is, in which for every /(x) G fc[x] there exists r G k such that 
/(r) = 0. It would take us too far afield to discuss this notion at any length now; 
but the reader should be aware that C is algebraically closed. We will come back 
to this 20 . Assuming this fact, it is easy to verify (Exercise 4.21) that the maximal 
ideals in C[x] are all and only the ideals 

(x-z) 

where z ranges over all complex numbers. That is, stretching our imagination a 
little, we could come up with the following picture for SpecC[x]: 


(x) (x — i) (x — 1) 



There is a ‘complex line 21 worth’ of maximal ideals: for each z G C we have the 
maximal (x - z)\ the prime (0) is contained in all the maximal ideals; and there 
are no other prime ideals. 

The picture for SpecC[x] may serve as justification for the fact that, in algebraic 
geometry, C[x] is the ring corresponding to the ‘affine line’ C; it is the ring of an 
algebraic curve. It turns out that the fact that there is exactly ‘one level’ of maximal 

~^A particularly pleasant proof may be given using elementary complex analysis, as a conse- 
quence of Liouville’s theorem, or the maximum modulus principle; cf. §V.5.3. 

21 We know: it looks like a plane. But it is a line, as a complex entity. 
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ideals over (0) in C[ar] reflects precisely the fact that the corresponding geometric 
object has dimension 1. 

In general, the (Krull) dimension of a commutative ring R is the length of the 
longest chain of prime ideals in R. Thus, Proposition 4.13 tells us that PIDs, such 
as Z, have ‘dimension 1’. In the lingo of algebraic geometry, they all correspond to 
curves. j 

Example 4.15. For examples of rings of higher dimension, consider 

where k is a field. Note that there are chains of prime ideals of length n 
(0) £ (*i) £ (a;i,ar 2 ) £ ■ • • £ (an, 

in this ring. (Why are these ideals prime? Cf. Exercise 4.13.) This says that 
&[*! has dimension > n. One can show that n is in fact the longest length of 

a chain of prime ideals in fc(»i, . . . , *„]; that is, the Krull dimension of fe[«i, . . . , a;,,] 
is precisely n. In algebraic geometry, the ring C[a,'i, . . . , x„] corresponds to the 
nr dimensional complex space C". 

Dealing precisely with these notions is not so easy, however. Even the seemingly 
simple statement that the maximal ideals of C[x lt . . . , a: n ] are all and only the ideals 
(xi — si, . . . , x n — z n ), for (zi, . . . , z n ) € C", requires a rather deep result, known 
as Hilbert’s Nultstellensatz. -i 

We will come back to all of this and get a very small taste of algebraic geometry 
in §VII.2, after we develop (much) more machinery. 


Exercises 


4.1. > Let R be a ring, and let {7 q } q€/ i be a family of ideals of R. We let 

y; I a ■= < ^ r a such that r tt € I a and r a = 0 for all but finitely many a > . 

aeA l neA ) 

Prove that I a is an ideal of R and that it is the smallest ideal containing all of 
the ideals I a . [§4.1] 

4.2. > Prove that the homomorphic image of a Noetherian ring is Noetherian. That 
is, prove that if tp : R — * S is a surjective ring homomorphism and R. is Noetherian, 
then S is Noetherian. f§6.4] 

4.3. Prove that the ideal (2, x) of Z[a:] is not principal. 

4.4. > Prove that if k is a field, then fc[x] is a PID. (Hint: Let / C fc[i] be any ideal. 
If I = (0), then I is principal. If 7 ^ (0), let /(*) be a monic polynomial in 7 of 
minimal degree. Use division with remainder to construct a proof that 7 = (/(a:)), 
arguing as in the proof of Proposition II.6.9.) [§4.1, §4.3, §V.2.4, §V.4.1, §VI.7.2, 
§VII.1.2] 
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4.5. t> Let I, J be ideals in a ring R, such, that I + J = (1). Prove that IJ = I (1 J. 

[S4.1] 

4.6. Let I, J be ideals in a ring R. Assume that R/{IJ) is rednced (that is, it has 
no nonzero nilpotent elements; cf. Exercise 3.13). Prove that IJ = / n J. 

4.7. > Let R = k be a field. Prove that every nonzero (principal) ideal in &[*] is 
generated by a unique monic polynomial. [§4.2, §VI.7.2] 

4.8. > Let J? be a ring and /( x) € R\x\ a monic polynomial. Prove that f(x) is 
not a (left- or right-) zero-divisor. [§4.2, 4.9) 

4.9. Generalize the result of Exercise 4.8, as follows. Let R be a ring, and let f(x) 
be a left-zero-divisor in f?[x]. Prow that 36 € i?, 6 / 0, such that f(x)b = 0. (Hint: 

Let f(x) = a<ix d 4 H ao, and let g(x) = b e x e H 1- 6o he a nonzero polynomial 

of minimal degree e such that f(x)g(x) = 0 . Deduce that aag(x) = 0, and then 
prove a<t-ig(x) = 0 for all i, by induction. What does this say about 6 e ?) 

4.10. -i Let d be an integer that is not the square of an integer, and consider the 
subset of C defined by 22 

Q(v^d) := {o + 6vQ|a, 5eQ}. 

• Prove that Q (s/d) is a subring of C. 

• Define a function N : Q(Vd) — > Z by N(a + by/d) := a 2 - b 2 d. Prove that 
N(zw) = N(z)N(w) and that N(z) ^ 0 if z € Q(>/d), zjt 0. 

The function N is a ‘norm’; it is very useful in the study of Q(Vd) and of its 
subrings. (Cf. also Exercise 2.5.) 

• Prove that Q(\/d) is a field and in fact the smallest subfield of C containing both 
Q and s/d. (Use A r .) 

• Prove that Q(\/d) = Q[f]/(i 2 — d). (Cf. Example 4.8.) 

[V.1.17, V.2.18, V.6.13, VII.1.12] 

4.11. Let R be a commutative ring, a € R, and fi(x), . . - ,f T (x) 6 /?.[*]. 

• Prove the equality of ideals 

(/i(«). .... Mx), x-a) = (/i(«), . . . , f r (a), x-a). 

• Prove the useful substitution trick 

R 

(/i(*)t • • » . } t { x ), a: — a) " (/i(a),...,/ r (a))’ 

(Hint: Exercise 3.3.) 

4.12. t> Let R. be a commutative ring and at, • • ■ , On elements of R. Prove that 

fl[3h, . .,g»] ^ R 

(*i — at, ... , x n — a n ) 

[§VII.2.2] 

22 Of course there are two ‘square roots of d’; but the definition of Q(Vd) does not depend 
on which one is used. 
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4.13. > Let ft be an integral domain. For all k = 1 , . . . , n prove that (xi, . . . ,Xh) 
is prime in ft[a:i, . . . , a: n ]. [§4.3] 

4.14. t> Prove ‘by hand’ that maximal ideals are prime, without using quotient 
rings. [§4.3] 

4.15. Let <fi : R — * S be a homomorphism of commutative rings, and let I C S be 

an ideal. Prove that if / is a prime ideal in S, then <p~ l (I) is a prime ideal in ft. 
Show that is not necessarily maximal if I is maximal. 

4.16. Let ft be a commutative ring, and let F he a prime ideal of ft. Suppose 0 is 
the only zero-divisor of ft. contained in P. Prove that ft is an integral domain. 

4.17. -• (If you know a little topology. . . ) Let ft” be a compact topological space, 
and let ft be the ring of continuous real-valued functions on K, with addition and 
multiplication defined poiutwise. 

(i) For p € K, let M p = {/ € ft | f(p) = 0}. Prove that M p is a maximal ideal in 

ft. 

(ii) Prove that if /i / r 6 ft have no common zeros, then (/i, . . . , / r ) = (1). 

(Hint: Consider /i + ■ • • + /r •) 

(iii) Prove that every maximal ideal M in ft is of the form M p for some p € K. 
(Hint: You will use the compactness of K and (ii).) 

Conclude that p >-» M p defines a injection from K to the set of maximal ideals 
of ft. (The set of maximal ideals of a commutative ring ft. is called the maximal 
spectrum of ft; it is contained in the (prime) spectrum Spec ft defined in §4.3. 
Relating commutative rings and ‘geometric’ entities such as topological spaces is 
the business of algebraic geometry.) 

The compactness hypothesis is necessary: cf. Exercise V.3.10. [V.3.10J 

4.18. Let ft be a commutative ring, and let N be its nilradical (Exercise 3.12). 
Prove that N is contained in every prime ideal of ft. (Later on the reader will 
check that the nilradical is in fact the intersection of all prime ideals of ft: Exer- 
cise V.3.13.) 

4.19. Let ft. be a commutative ring, let P be a prime ideal in ft, and let Ij be 
ideals of ft. 

(i) Assume that Jj • • • J r C P; prove that Ij C P for some j. 

(ii) By (i), if F O f)^ = i Ij* then F contains one of the ideals Ij. Prove or disprove: 
if F 3 fl£i Jji then F contains one of the ideals Ij. 

4.20. Let M be a two-sided ideal in a (not necessarily commutative) ring ft. Prove 
that M is maximal if and only if ft/M is a simple ring (cf. Exercise 3.9). 

4.21. i> Let k be an algebraically closed field, and let I C &[*] be an ideal. Prow 
that I is maximal if and only if I = (x — c) for some c e k. [§4.3, §V.5.2, §VII.2.1, 
§VII.2.2] 

4.22. Prove that (x 2 + 1) is maximal in R[xj. 
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4.23. A ring R. has Krull dimension 0 if every prime ideal in R is maximal. Prove 
that fields and Boolean rings (Exercise 3.15) have Krull dimension 0. 

4.24. Prove that the ring Z[x\ has Krull dimension > 2. (It is in fact exactly 2; 
thus it corresponds to a surface from the point of view of algebraic geometry.) 


5. Modules over a ring 

We have emphasized the parallels between the basic theory of groups and the basic 
theory of rings; there are also important differences. In Grp, one takes the quotient 
of a group, by a (normal sub) group, and the results is a group: throughout the 
process one never leaves Grp. The situation is in a sense even better in Ab, where 
the normality condition is automatic; one simply takes the quotient of an abelian 
group by an abelian (sub)group, obtaining an abelian group. 

The situation in Ring is not nearly as neat. One of the three characters in the 
story is an ideal: which is not a ring according to the axioms listed in Definition 1.1, 
in all but the most pathological cases. In other words, the kernel of a homomorphism 
of rings is (usually) not a ring. Also, cokemels do not behave as one would hope 
(cf. Exercise 2.12). Even if one relaxes the ring definition, giving up the identity 
and making ideals subrings, there is no reasonably large class of examples in which 
the ‘ideal’ condition is automatically satisfied by substructures. In short, Ring is 
not a particularly pleasant category. 

Modules will fix all these problems. If R is a ring and / C R. is a two-sided 
ideal, then all three structures R, I, and R/l are modules over R. The category 
of B-modules is the prime example of a well-behaved category: in this category 
kernels and cokernels exist and do precisely what they ought to. The category Ab 
is a particular' case of this construction, since it is the category of modules over 
Z in disguise (Example 5.4). The category of modules over a ring R. will share 
many of the excellent properties of Ab; and we will get a brand new, well-behaved 
category for each ring R. These are all examples 23 of the important notion of 
abelian category. 

5.1. Definition of (left-)!?- module. In short, H-modules are abelian groups 
endowed with an action of R. To flesh out this idea, recall that ‘actions’ in general 
denote homomorphisms into some kind of endomorphism structure: for example, 
we defined group actions in §11.9.1 as group homomorphisms from a fixed group to 
the groups of automorphisms of objects of a category. 

We can give an analogous definition of the action of a ring on an abelian group. 
Indeed, recall that if M is an abelian group, then EndAb(Af) := HomAb(Af, M) is 
a ring in a natural way (cf. §2.5). A left-action of a ring R on M is then simply a 
homomorphism of rings 

a : R — » EndAb(M); 

^In fact, it can be shown (‘Freyd-Mitchell’g embedding theorem’) that every small abelian 
category is equivalent to a subcategory of the category of left-modules over a ring. So to some 
extent we can understand abelian categories by understanding categories of modules well enough. 
We will come back to this in Chapter IX. 
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we will say that o makes M into a left-R-module. 

Similarly to the situation with group actions, it is convenient to spell out this 
definition. 

Claim 5.1. The datum of a homomorphism a as above is precisely the same as the 
datum of a function 

p: Rx M —> M 

satisfying the following requirements: (Vr, s £ R) (Vm, n € M) 

• p(r, m + n) = p(r, m) + p(r, n); 

• p(r +s,m) = p(r, m) + p(s, m),- 

• p(rs, m) = p(r, p(s, m)); 

• p( 1, m) = m. 

The proof of this claim follows precisely the pattern of the discussion in the 
beginning of §11.9.2, so it is left to the reader (Exercise 5.2). Of course the relation 
between p and o is given by 

p(r, m) = <r(r)(m). 

As always, carrying p around is inconvenient, so it is common to omit mention of 
it: p(r, m) is often just denoted rra; this makes the requirements listed above a 
little more readable. Summar izing and adopting this shorthand, 

Definition 5.2. A left-Jf-module structure on an abelian group M consists of a 
map R x M — » M, (r, m) *-> rm, such that 

• r(m + n) = rm + rn; 

• (r + s)m — rm + sm ; 

• (rs)m = r(sm); 

• lm = m. j 

Right-R-moA\iles are defined analogously. The reader can glance back at §11.9, 
especially Exercise II.9.3, for a reminder on right- vs. left-actions; the issues here are 
analogous. Thus, for example, a right- J?-module structure may be identified with 
a left- R° -module structure, where R° is the ‘opposite’ ring obtained by reversing 
the order of multiplication (Exercise 5.1). However, note that R and R° have no 
good reasons to be isomorphic in general (while every group is isomorphic to its 
opposite). 

These issues become immaterial if R is commutative: then the identity R —* R° 
is an isomorphism, and left-modules/right-modules are identical concepts. The 
reader will not miss much by adopting the blanket assumption that all rings men- 
tioned in this section are commutative. It is occasionally important to make this 
hypothesis explicit (for example in dealing with algebras, cf. Example 5.6), but most 
of the material we are going to review works verbatim for, say, left-modules over an 
arbitrary ring as for modules over a commutative ring. We will write ‘module’ for 
‘left-module’, for convenience; it will be the reader’s responsibility to take care of 
appropriate changes, if necessary, to adapt the various concepts to right-modules. 
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5.2. The category R-Mod. The reader should spend some time getting familiar 
with the notion of module, by proving simple properties such as 

(Vm € M) : 0 • m = 0, 

(Vm e M) : (-1) ■ m = -m, 

where M is a module over some ring (Exercise 5.3). The following fender-sounding 
(but equally trivial) property is also useful in developing a feel for modules: 

Proposition 5.3. Every abelian group is a Z-module, in exactly one way. 

Proof. Let G be an abelian group. A Z-module structure on G is a ring homo- 
morphism 

Z -* EndAb(G). 

Since Z is initial in Ring (§2.1), there exists exactly one such homomorphism, prov- 
ing the statement. □ 

Thus, ‘abelian group’ and ‘Z-module’ are one and the same notion. Quite 
concretely, the action of n e Z on an element a of an abelian group simply yields 
the ordinary ‘multiple’ no; this operation is trivially compatible with the operations 
in Z. 

A homomorphism of B-modvles is a homomorphism of (ahelian) groups which 
is compatible with the module structure. That is, if M, N are R-modules and 
y> : M — N is a function, then <p is a homomorphism of R-modules if and only if 

• (Vtoj € M)(Vm 2 € M) : ip(mi + m 2 ) - p(tm) + tp(m 2 )\ 

• (dr e R.)(Vm € M) : <p(rm) = np(m). 

It is hopefully clear that the composition of two R-modnle homomorphisms is an 
R-module homomorphism and that the identity is an R-module homomorphism: 

R-modules form a category 

which we will denote 24 ‘R-Mod’. 

Example 5.4. The category Z-Mod of Z-modules is ‘the same as’ the category Ab: 
indeed, every abelian group is a Z-module in exactly one way (Proposition 5.3), 
and Z-module homomorphisms are simply homomorphisms of abelian groups, j 

Example 5.5. If R = k is a field, R-modules are called k-vector spaces. We will 
call the category of vector spaces over a field It ‘fc-Vect’; this is just another name 
for fc-Mod. Morphisms in fc-Vect are often called 25 linear maps. ‘Linear algebra’ is 
the study of fc-Vect (extended to R-Mod when possible); Chapters VI and VIII will 
be devoted to this subject. j 


24 If R is not commutative, we should agree on whether R-Mod denotes the category of 
fe/t-modules or of rty/tf-modules. We will mean ‘left-modules’. 

25 This term is also used for homomorphisms of R. modules for more general rings R, but not 
as frequently. 
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Example 5.6. Any homomorphism of rings a . B. —> S may be used to define an 
interesting .R-module: define p : B. x S — * S by 

p(r, s) := ct(r)s 

for all r G B. and s G S. The operation on the right is simply multiplication in S, 
and the axioms of Definition 5.2 are immediate consequence of the ring axioms and 
of the fact that a is a homomorphism. For instance, taking S = B, and a = idp 
makes B a (left-) module over itself. 

It is common to write rs rather than a(r)s. 

The ring operation in S and the fl-module structure induced by a are linked 
even more tightly if we require R to be commutative and a to map B to the center 
of S: that is, if we require a(r), s to commute for every r € B, s G S. Indeed, in this 
case the left-module structure defined above and the right-module structure defined 
analogously would coincide; further, with these requirements the ring operation in S 

(si,s 2 ) *-> 9l»2 

is compatible with the i?-module structure in the sense that 26 

( r is 1 )(r 2 s 2 ) = a(r,)sia(r 2 )8 2 = a(r 1 )Q(r 2 )s 1 s 2 = (r 1 r 2 )(s 1 s 2 ) 

Vr[,r 2 € R, V«i,s 2 € S: that is, we can ‘move’ the action of f? at will through 
products in S. 

Due to their importance, these examples deserve their own official name: 

Definition 5.7. Let B. be a commutative ring. An R.-algebra is a ring homomor- 
phism a: R.—t S such that a(B) is contained in the center of S'. _j 

The usual abuse of language leads us to refer to an i?-algebra by the name of 
the target S of the homomorphism. Thus, an f?-algebra ‘is’ an J?-module S with a 
compatible ring structure, or, if you prefer, a ring S with a compatible f?- module 
structure. An iZ-algebra S is a division algebra if S is a division ring. 

There is an evident notion of algebra homomorphism ’ (preserving both the 
ring and module structure), aud we thus get a category f?-Alg. The situation 
simplifies substantially if S is itself commutative, in which case the condition on 
the center is unnecessary. ‘Commutative f?-algebras’ form a category, which the 
attentive reader will recognize as a coslice category (Example L3.7) in the category 
of commutative rings. 

Also, note that Z-Alg is just another name for Ring (why?). 

The polynomial rings B{x i, as well as all their quotients, are commu- 

tative R-algebras. This is a particularly important class of examples; for B = k 
an algebraically closed field, these are the rings used in ‘classical’ affine algebraic 
geometry (cf. §VII.2.3). j 

The trivial group 0 has a unique module structure over any ring B and is a zero- 
object in f?-Mod, that is, it is both initial and final. As in the other main categories 
we have encountered, a bijective homomorphism of j?-modules is automatically an 

28 More generally, this choice makes the multiplication in S ' B-bilinear 
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isomorphism in R-Mod (Exercise 5.12). In these and many other respects, the 
category I?- Mod (for any commutative ring R) and Ab are similar. 

If R is commutative, the similarity goes further: just as in the category Ab, each 
set Hom/?,Mod(M, N) may itself be seen as an object of the category (cf. §II.4.4) 27 . 
Indeed, let M and N be R-modules. Since homomorphisms of R-modules are in 
particular homomorphisms of abelian groups, 

Homn_Mod(M, N) £ HoniAbOV/, N) 

as sets (up to natural identifications). The operation making HomAb(-M, N) into an 
abelian group, as in §11.4.4, clearly preserves Hom^Mod (Af,iV); it follows that the 
latter is an abelian group. For r e R and <p € Hom^M«t(M, N), the prescription 

(Vm e M) : (r<p)(m) := rtp(m) 

defines a function 28 r<p : M — > N. This function is an R-module homomorphism if 
R is commutative, because (Va 6 R ), (Vm 6 M) 

(r<p)(am) = rtp(am) = (ra)ip(m) = (ar)p(m) = a{rtp(m)). 

Thus, we have a natural action of R on the abelian group Hom/j.Mo^M, N), and it 
is immediate to verify that this makes HomR_M«i(A/, N) into an R- module. 

Watch out: if R is not commutative, then in general Homj?„Mod(Af> -Af) is ‘just’ 
an abelian group. More structure is available if M, N are bimodules; cf. §VIII.3.2. 

5.3. Submodules and quotients. Since R-modules are an ‘enriched’ version of 
abelian groups, we can progress through the usual constructions very quickly, by 
just pointing out that the analogous constructions in Ab are preserved by the R- 
module structure. 

A submodule N of an R-module M is a subgroup preserved by the action of R. 
That is, for all r 6 R and n e N, the element rn (defined by the R-module structure 
of M) is in fact in N. Put otherwise, and perhaps more transparently, N is itself 
an R-module, and the inclusion N C M is an R-module homomorphism. 

Example 5.8. We can view R itself as a (left-) R-module (cf. Example 5.6); the 
submodules of R are then precisely the (\eft-)ideal$ of R j 

Example 5.9. Both the kernel and the image of a homomorphism <p : M -» M' 
of R-modules are submodules (of M, M‘ , respectively). j 

Example 5.10. If r e R and M is an R-module, rM = (rm | m e M} is a 
submodule of M. If I is any ideal of R, then IM = (rm|r 6 I,m € M} is a 
submodule or R. 

If N is a submodule of M, then it is in particular a (normal) subgroup of the 
abelian group (M, +); thus we may define the quotient M/N as an abelian group. 

27 Notational convention: One often writes HoronfA/, W) for Homn-ModflV, Rj. We will not 
adopt this convention here but we will use it freely in later chapters. 

28 Parse the notation carefully: rtf is the name of a function on the left, while on the 

right is the action of the element r 6 R on the element ^r(m) € /V, defined by the R-module 
structure on N. 




5. Modules over & ring 


161 


Of course it would be desirable to see this as a module, and as usual there is only 
one reasonable way to do so: we will want the canonical projection 

it : M -* M/N 

to be an ft-modnle homomorphism, and this forces 

r(m + N) = nr(m) = 7r(rm) = rm + N 
for all m e M. That is, we are led to define the action of ft on M/N by 

r(m + jV) := rm + N. 

Claim 5.11. For all submodules N, this prescription does define a structure of 
R-module on M/N. 

The proof of this claim is immediate and is left to the reader. The ft- module 
M/N is (of course) called the quotient of M by N. 

Example 5.12. If R. is a ring and / is a two-sided ideal of ft, then all three of I, 
ft, and the quotient ring R/I are ft-modules: / is a submodule of R, and the rings 
R and R/I are in fact ft-algebras if ft is commutative (cf. Example 5.6). _i 

Example 5.13. If R is not commutative and I is just a (say) left-ideal, then 
the quotient ft.// is not defined as a ring, but it is defined as a left- module (the 
quotient of the module R by the submodule /). The action of R. on R/I is given 
by left-multiplication: r(a + I) - ra + 1. 

The reader should now expect a universal property for quotients, and here it is: 

Theorem 5.14. Let N be a submodule of an R-module M. Then for every ho- 
momorphism of R.-modules ip : M —* P such that N C ker <p there exists a unique 
homomorphism of R.-modules : M/N — » P so that the diagram 

M - >P 


M/N 

commutes. 

As in previous appearances of such statements, this is an immediate conse- 
quence of the set-theoretic version (§1.5.3) and of easy notation matching and com- 
patibility checks. For an even faster proof, one can just apply Theorem II.7.12 and 
verify that <p is an R-module homomorphism. 

Since every submodule N is then the kernel of the canonical projection M — > 
M/N, our recurring slogan becomes, in the context of .ft- Mod 

kernel <t=s submodule : 

unlike as in Grp or Ring, being a kernel poses no restriction on the relevant sub- 
structures. Put otherwise, ‘every monomorphism in ft-Mod is a kernel’; this is one 
of the distinguishing features of an abelian category. 
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5.4. Canonical decomposition and isomorphism theorems. The discussion 
now proceeds along the same lines as for (abelian) groups; the statements of the key 
facts and a few comments should suffice, as the proofs are nothing but a rehashing 
of the proofs of analogous statements we have encountered previously. Of course 
the reader should take the following statements as assignments and provide all the 
needed details. 


In the context of modules, the canonical decomposition takes the following 
form: 

Theorem 5.15. Every R-module homomorphism <p : M —> M' map be decomposed 
as follows: 


<e 



where the isomorphism in the middle is the homomorphism induced by <p (as in 
Theorem 5.14). 


The ‘first isomorphism theorem’ is the following consequence: 


Corollary 5.16. Suppose <p : M 
Then 



If M is an B-module and AT is a submodule of M, then there is a bisection 
(cf. §11.8.3) 

tt : {submodules P of M containing AT} — * {submodules of M/N} 
preserving inclusions, and the ‘third isomorphism theorem’ holds: 

Proposition 5.17. Let N be a submodule of an R-module M, and let P be a 
submodule of R containing N. Then P/N is a submodule of M/N, and 


M/N M 
P/N ~ P ' 


We also have a version of the ‘second isomorphism theorem’ (cf. Proposi- 
tion 11.8.11), with simplifications due to the fact that normality is not an issue 
in the theory of modules: 

Proposition 5.18. Let N, P be submodules of an R-module M. Then 

• N + P is a submodule of M; 

• iVflP is a submodule of P, and 

N + P ~ P 
N ~ NnP' 


More generally, it is hopefully clear that the sum and intersection 

fU of any family {A r a ) a of submodules of an ffc-module M (which are defined 
as subgroups of the abelian group M\ cf. for example Lemma II.6.3) are submodules 
of M. 
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Exercises 


5.1. o Let R be a ring. The opposite ring E° is obtained from R by reversing the 
multiplication: that is, the product a • b in E° is defined to be 6a £ R. Prove that 
the identity map E -+ E° is an isomorphism if and only if i?. is commutative. Prove 
that Af„(R) is isomorphic to its opposite ( not via the identity map!). Explain 
how to turn right- E-modules into left-E-modules and conversely, if R. = E°. [§5.1, 
VIII.5.19J 

5.2. > Prove Claim 5.1. [§5.1] 

5.3. t> Let M be a module over a ring R. Prove that 0 • m = 0 and that (— 1) • m = 
—m, for all m S M. [§5.2] 

5.4. -■ Let R be a ring. A nonzero E-module M is simple (or irreducible) if its only 
submodules are {0} and M. Let M, N be simple modules, and let <p : M — » N be 
a homomorphism of R- modules. Prove that either ip = 0 or tp is an isomorphism. 
(This rather innocent statement is known as Schur’s lemma.) [5.10, 6.16, VI. 1.16] 

5.5. Let E be a ring, viewed as an E-module over itself, and let M be an E-module. 
Prove that Hom/?„Mcxl(E, M) = M as E-modules. 

5.6. Let G be an abelian group. Prove that if G has a structure of Q-vector space, 
then it has only one such structure. (Hint: First prove that every element of G has 
necessarily infinite order.) 

5.7. Let if be a field, and let k C K be a subfield of if. Show that if is a vector 
space over k (and in fact a fc-algebra) in a natural way. In this situation, we say 
that if is an extension of fc. 

5.8. What is the initial object of the category E-AIg? 

5.9. -> Let E be a commutative ring, and let M be an E-module. Prove that 
the operation of composition on the E-module Endi?_Mod(Af) makes the latter an 
E-algebra in a natural way. 

Prove that Mn(R) (cf. Exercise 1.4) is an E-algebra, in a natural way. [VI. 1,12, 
VI.2.3] 

5.10. Let E be a commutative ring, and let M be a simple E-module (cf. Exer- 
cise 5.4). Prove that End/j.Mod(Af) is a division E-algebra. 

5.11. ▻ Let R be a commutative ring, and let M be an E-module. Prove that there 
is a bijection between the set of E[x]-module structures on M and End/j_Mod(Af). 

[§VI.7.1] 

5.12. > Let E be a ring. Let M, N be E-modules, and let tp : M —* N be a 
homomorphism of E-modules. Assume <p is a bijection, so that it has an inverse <p~ x 
as a set-function. Prove that p~ 1 is a homomorphism of E-modules. Conclude 
that a bijective E-module homomorphism is an isomorphism of E-modules. [§5.2, 
SVL2.1, § DC. 1.3] 
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5.13. Let R be an integral domain, and let I be a nonzero principal ideal of R. 
Prove that I is isomorphic to R as an R-module. 

5.14. > Prove Proposition 5.18, f§5.4] 

5.15. Let J? be a commutative ring, and let I, J be ideals of R. Prove that 
I ■ (R/J) = (/ + J)/J as R-modules. 

5.16. -i Let R be a commutative ring, M an .R-module, and let a e R be a nilpotent 
element, determining a submodule oM of M. Prove that M = 0 <=>• aM = M. 
(This is a particular case of Nakayama ’s lemma, Exercise VI.3.8.) [VI.3.8] 

5.17. > Let R be a couunutative ring, and let I be an ideal of R, Noting that 
Pl k C P +k , define a ring structure on the direct sum 

Reesjj(7):=®7 i = R©7©7 2 ©7 3 ©--- . 
i> o 

The homomorphism sending R identically to the first term in this direct sum makes 
R,eesij(7) into an R-algebra, called the Rees algebm of 7. Prove that if a € R is 
a non-zero-divisor, then the Rees algebra of (a) is isomorphic to the polynomial 
ring R[a:] (as an R-algebra). [5.18] 

5.18. With notation as in Exercise 5.17 let a € R he a non-zero-divisor, let 7 he 
any ideal of R, and let J be the ideal al. Prove that Rees«(J) £f Reesf?(7). 


6. Products, coproducts, etc., in R-Mod 

We have stated several times that categories such as R-Mod are ‘well-behaved’. We 
null explore in this section the sense in which this can be formalized at this stage. 
The bottom line is that these categories enjoy the same nice properties that we 
have noted along the way for the category Ab. 

We will also include here some general considerations on finitely generated 
modules and algebras. 

As in the previous section, we will write ‘module’ for ‘left-module’; the reader 
should make appropriate adaptations to the case of right-modules. Little will be 
lost by assuming that all rings appearing here are commutative (thereby removing 
the distinction between left- and right-modules). 

6.1. Products and coproducts. As in Ab, products and coproducts exist, and 
finite products and coproducts coincide, in R-Mod. Indeed, recall the construction 
of the direct sum of two abelian groups (§11.3.5): if M and N are abelian groups, 
then M©1V denotes their product, with componentwise operation. If Af and N are 
R-modules, we can give an R-modnle structure to M © Af by prescribing Vr € R 

r(m, n) := (rm,m). 

This defines the direct sum of M, N, as an R-module. Note that M © N comes 
together with several homomorphisms of R-modules: 

ic\f : M © N — ► M, it) v : M © N —* N 
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sending (m, n) to m, n, respectively, and 

: M — ► M © N, iff ■ N — » M © N 

sending m to (m, 0) and n to (0, ji). 

Proposition 6.1. The direct sum M © N satisfies the universal properties of both 
the product and the coproduct of M and N. 

Proof. Product- Let P be an ^module, and let <pm ■ P —> M, <pn : P —* N be 
two f?-module homomorphisms. The definition of an f?,-module homomorphism 

<PM x <PN • P — * M © N 
is forced by the needed commutativity of the diagram 



That is, 

(Vp e P) : (tp M x <ptf)(p) := (‘pM(jj),tpff(p))- 
This is an i?-inodule homomorphism, and it is the unique one making the diagram 
commute; therefore M © N works as a product of M and N . 

Coproduct: View the preceding argument through a mirror! Let P be an R- 
module, and let ipM : M — ► P, ipN '• N -* P be two /^-module homomorphisms. 
The definition of an /?-module homomorphism 

i’M © V : M © N — ► P 
is forced by the needed commutativity of the diagram 



That is 29 , (Vm e M)(Vn € N) 

(4>M © $tl )(™, ") = ('I’M © 0 ) + U>M © M(0, n) 

= ('Pm © M ° + (ip M © iPn) O */v(n) 

= i>M(m) + i>N(n). 

This is an iJ-module homomorphism: it is a homomorphism of abelian groups by 
virtue of the commutativity of addition in P , and it clearly preserves the action of R. 

29 This should look familiar; cf. Exercise II. 3.3. 
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Since it is the unique iJ-module homomorphism making the diagram commute, this 
verifies that M © N works as a coproduct of M and N. □ 

It may seem like a good idea to write MxN rather than M®N when viewing 
the latter as the product of M and N; but in due time (§VIII.2) we will encounter 
MxN again, in the context of ‘bilinear maps’, and in that context MxN is not 
viewed as an i?-module. 

The reader should also work out the fibered versions of these constructions; 
cf. Exercises 6.10 and 6.11. 

The fact that finite products and coproducts agree in f?-Mod does not extend 
to the infinite case (Exercise 6.7). 

6.2. Kernels and cokernels. The facts that B-Mod has a zero-ohject (the 0- 
uiodule), its Horn sets are abelian groups (§5.2), and it has (finite) products and 
coproducts make f?-Mod an additive category. The fact that B-Mod has well- 
behaved kernels and cokemels, which we review next, upgrades it to the status of 
abelian category. (We will come back to these general definitions in §IX.l.) 

In general, monomorphisms and epimorphisms do not automatically satisfy 
good properties, even when objects of a given category are realized by adding 
structure to sets. For example, we have seen that the precise relationship between 
‘surjective morphism* and ‘epimorphism’ may be rather subtle: epimorphisms are 
surjective in Grp, but for complicated reasons (§11.8.6); and there are epimorphisms 
that are not surjective in Ring (§2.3). 

The situation in B-Mod is as simple as it can be. Recall that we have identified 
universal properties for kernels and cokernels (cf. §11.8.6); in the category B-Mod 
these would go as follows: if 

<p:M ->N 

is a homomorphism of B-modules, then kerbs’ is final with respect to the property 
of factoring B-module homomorphisms a : P — » M such that <p o a = 0: 


o 



while coker y is initial with respect to the property of factoring B-module homo- 
morphisms (3 : N —* P such that 0 o ip = 0: 

o 



coker <p 


Proposition 6.2. The following hold in R- Mod; 
• kernels and cokemels exist; 
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• <p is a monomorphism ■<=>■ ker ip is trivial 4 =^ ip is injective as a set- 
function; 

• ip is an epimorphism ■4=> cokers is trivial <p is surjective as a set- 
junction. 

Further, every monomorphism identifies its source with the kernel of some mor- 
phism, and every epimorphism identifies its target with the cokernel of some mor- 
phism. 

This proposition of course simply generalizes to R-Mod facts we know already 
from our study of Ab, and a quick review should suffice for the careful reader. Ker- 
nels exist: indeed, the ‘standard’ definition of kernel satisfies the universal proper- 
ties spelled out above (same argument as in Proposition II.6.6). Cokernels exist: 
indeed, let 



if 0 : N — ► P is such that fiotp — 0, then im <p C ker (3\ so 8 must factor uniquely 
through N/imtp by the universal property of quotients, Theorem 5.14. That is, 
Nf imp does satisfy the universal property for cokernels. 

The proofs of all the implications in the second and third points in Proposi- 
tion 6.2 follow familiar patterns from (for example) Proposition II.6.12 and Propo- 
sition 11.8.1$. The last sentence of Proposition 6.2 simply reiterates the slogan 
submodule -4=4- kernel and its mirror statement (which is just as true). Further 
details are left to the reader. 

By the way, whatever happened to the conditions characterizing monomor- 
phlsms and epimorphisms in Set (Proposition 1.2.1)? In Set, a function is a monomor- 
phism if and only if it has a left-inverse, and it is an epimorphism if and only if it 
has a right-inverse. We have learned not to expect any of this to happen in more 
general categories. Modules are uot an exception: the function Z — * Z defined by 
‘multiplication by 2’ is a monomorphism without a left-inverse, and the projection 
Z — > Z/2Z is an epimorphism without a right-inverse. We will come back to this 
point in §7.2. 


6.3. Free modules and free algebras. The universal property of free R-modules 
is modeled after the properties defining the other free objects we have encountered: 
the goal is to define the R-module containing a given set A ‘in the most efficient 
way’. Again, the situation will match the case of abelian groups closely, so the 
reader may want to refer hack to §11.5.4. 

The universal property formalizing the heuristic requirement goes as follows: 
given a set A, we seek an R-module F R (A), called a free R-module on the set 
A, together with a set-function j : A — ► F R (A), such that for all .R-modules M 
and set-functions / : A — * M there exists a unique R-module homomorphism 
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if : F r (A) — » M such that the diagram 



A 


commutes. Abstract nonsense guarantees that such an ii-module is unique up to 
isomorphism if it exists at all (Proposition 1.5.4) aud that the function j : A —> 
F r (A) is necessarily injective (cf. Exercise II.5.3). The question is, does it exist? 

The answer will not appear to be very exciting, since it generalizes directly the 
case of abelian groups (that is, Z-modules). Given any set A, define the (possibly 
infinite) direct sum AT®** of an f?.-module N as follows: 

N® a := {a : A — *■ N | a(a) ^ 0 for only finitely many elements a e A}. 

Of course this agrees with the definition given for abelian groups in §11.5.4; N® A 
has an evident .R-module structure, obtained by defining, for all r e R and a? A, 

(ra)(a) := r(a(a)). 

For N = R we may define a function j : A 
function j a : A -* R: 

(Vx e A) : j a (x) := | J 

Claim 6.3. F fl (A) = R® A . 

The proof of this claim matches precisely the proof of Proposition II.5.6 and 
is left to the reader (Exercise 6.1). The key is that every element of R® A may be 
written uniquely as a finite sum 

a 

q€-4 

(shorthand for r a j(a)); incidentally, this is how elements of ‘the free R- 

module on A’ are often written — this is legal, by virtue of Claim 6.3. 

In particular, for A = {1, . . . , n} a finite set. Claim 6.3 states that the /?- module 
if®", with j : A — > jR®” defined by 

j(t) := (0, • • • , 0, , 1 ,0, - ,0) € ft®", 

i-th place 

satisfies the universal property for F fi ({l, . . . ,n}). 

This is all entirely analogous to the story for Z-modules. The situation becomes 
a little more interesting if we switch from ft-modules to commutative R-algebras; 
the category is different, so we should expect a different answer. The finite case is 
essentially the only one we will need in these notes, so assume A = {1, . . . ,n} is 
a finite set. In this case, we write ft[A] for the polynomial ring ft[xi , . . . ,x rt ]; we 
have a set-function j \ A— * ft[A], defined by j(l) = a;,-. 

Proposition 6.4. R[A) is a free commutative R-algebra on the set A. 


— > R® a by sending a € A to the 

if x = a, 
if x a. 
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Proof. The statement translates into the following: for every commutative R- 
algebra S and every set-function / : A — » S, there exists a unique /{-algebra 
homomorphism : /{(A] — > S such that the diagram 



A 


commutes. Since 5 is an /{-algebra, we haw a fixed homomorphism of rings a : 
R S (cf. Example 5.6). Then we may construct <p : /?(A] = ...,£„] —►S' 

by applying n times the ‘extension property’ of Example 2.3: extend a to /{[n] so 
as to map xi to /(l), then to /{[aq,^] = /{[xilfo] so as to map x% to /( 2), etc. 
Note that each extension is uniquely determined by its requirements. 

This gives ip as a ring homomorphism and shows that it is unique. The reader 
will verify that p is also (automatically) an /{-module homomorphism, and hence 
a homomorphism of /{-algebras, concluding the proof. □ 

After the fact, the reader may want to revisit §2.2 and recognize that the ‘uni- 
versal property of polynomial rings Z[*i given there was really a version of 

their role as free objects in the category of commutative rings, a.k.a. commutative 
Z-algebras. 

It is not difficult to identify free objects in the larger category /{-Alg: they 
consist of ‘noncommutative polynomial rings’ R{A), with variables from the set A 
but without any condition relating ab and ba for a ^ b in A. More precisely, R{A) 
is isomorphic to the monoid ring (cf. §1.4) over the free monoid on A, consisting of 
all finite strings of elements in A, with operation defined by concatenation 30 . We 
will encounter this ring again, but only in the distant future (Example VIII.4.17). 

6.4. Submodule generated by a subset; Noetherian modules. Let M be 
an /{-module, and let A C M be a subset of M. By the universal property of free 
modules, there is a unique homomorphism of /{-modules 

<p A : R9 a — M. 

The image of this homomorphism is a submodule of M, the submodule generated by 
A in M, usually denoted {A) (or («i, . . . , « n ) if A = (oi , . . . , a n } is finite). Thus, 

{A) = {^2 r a a | r a ± 0 for only finitely many elements a e A}. 
a€A 

It is hopefully clear that (A) is the smallest submodule of M containing A. 

The module M is finitely generated if M = (A) for a finite set A, that is, if 
and only if there is a surjective homomorphism of /{-modules 

R® 71 -* M 

for some n. One of the highlights of Chapter VI will be the classification of finitely 
generated modules over PIDs (Theorem Vl.5.6). We have already briefly mentioned 

30 This construction is similar to the free group on A, but without the complication of the 
presence of inverses and of possible cancellations. 
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the case for Z (recall that Z is a PID and Z-modules are nothing but abelian groups!) 
back in §11,6.3. 

Finitely generated /2-modules are tremendously important, but they are not as 
well-behaved as one might hope at first. For example, it may be that a module M 
is finitely generated, but some submodule of M is not finitely generated! 

Example 6.6. Let R = Z[xi,X 2 , . . .], a polynomial ring on infinitely many inde- 
tenninates. Then R is finitely generated as an /2-module: indeed, 1 generates it. 
However, the ideal 

( [ x lt x 2 1 ■ • • ) 

of R. generated by all indeterminates is not finitely generated as an K- module 
(Exercise 6.14). j 

Definition 6.6. An /2-module M is Noetherian if every submodule of M is finitely 
generated as an /2-module. j 

Thus, a ring R is Noetherian in the sense of Definition 4.2 if and only if it is 
Noetherian ‘as a module over itself’. The ring in Example 6.5 is not Noetherian. 

We will study' the Noetherian condition more carefully later on (§V.1.1); but 
we can already see one reason why this is a good, ‘solid’ notion. 

Proposition 6.7. Let M be an R -module, and let N be a submodule of M. Then 
M is Noetherian if and only if both N and M/N are Noetherian. 

Proof. If M is Noetherian, then so is M/N (same proof as for Exercise 4.2), and so 
is A r (because every submodule of N is a submodule of M, so it is finitely generated 
because M is Noetherian). This proves the ‘only if’ part of the statement. 

For the converse, assume N and M/N are Noetherian, and let P be a submodule 
of M] we have to prove that P is finitely' generated. Since P ft N is a submodule 
of N and N is Noetherian, PH N is finitely generated. By the ‘second isomorphism 
theorem’, Proposition 5.18, 

P „ P + N 
PON N ’ 

and hence P/(Pn JV) is isomorphic to a submodule of M/N. Since M/N is Noe- 
therian, this shows that P/{P n JV) is finitely generated. 

It follows that P itself is finitely generated, by Exercise 6.18. □ 

Corollary 6.8. Let R be a Noetherian ring, and let M be a finitely generated 
R-modvle. Then M is Noetherian (as an R-module). 

Proof. Indeed, by hypothesis there is an onto homomorphism /2® 71 -* M of R- 
modules; hence (by' the first isomorphism theorem, Corollary 5.16) M is isomorphic 
to a quotient of J2® n . By Proposition 6.7, it suffices to prove that /2®” is Noetherian. 

This may be done by induction. The statement is true for n = 1 by hypothesis. 
For n > 1, assume we know that is Noetherian; since j?©( n-1 ) may be 

viewed as a submodule of Z2® n , in such a way that 

P®" 
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(Exerdse 6.4), and P is Noetherian, it follows that i?® n is Noetherian, again by 
applying Proposition 6.7. □ 

6.5. Finitely generated vs. finite type. If S is an P-algebra, it may be ‘finitely 
generated’ in two very different ways: as an P-module and as an P-algebra. It is 
important to keep these two concepts well distinct, although unfortunately the 
language used to express them is very similar - 

The following definitions differ in three small detaib. . , 


“ S is finitely generated as a module 
over R if there is an onto homomor- 
phism of P-modules from the free P- 
module on a finite set to S.” 


“S is finitely generated as an algebra 
over R if there is an onto homomor- 
phism of P-algebras from the free P- 
algebra on a finite set to S.” 


The mathematical difference is more substantial than it may appear. As we 
have seen in §6.3, the free R-modttle over a finite set A — {1, . . . ,n} is isomorphic 
to P®”; the free commutative R.-algebra over A is isomorphic to P[*i, . . . ,sc„]. 
Thus, a commutative 31 ring S is finitely generated as an R-modvle if there is an 
onto homomorphism of P- modules 

P®" -» 3 

for some n\ it is finitely generated as an R-algebra if there is an onto homomorphism 
of P-algebras 

P[a:i , . . . , *„] -* S 

for some n. In other words, S is finitely generated as an P module if and only if 
S 3! P.® n /M for some n and a submodule M of P®"; it is a finite-type P-algebra 
if and only if S = P[ari ,...,x n ]/I for some n and an ideal I of R.[x\ ar„] . 

We say that S is finite in the first case 32 and of finite type in the second. It 
is clear that ‘finite’ =>■ ‘finite type’; it should be just as clear that the converse 
does not hold. 

Example 6.9. The polynomial ring P[z] is a finite-type P-algebra, but it is not 
finite as an P-module. j 

The distinction, while macroscopic in general, may evaporate in special, impor- 
tant cases. For example, one can prove that if k and K are fields and k C K, then 
K is of finite type over k if and only if it is in fact finite as a fc-module (that is, it is 
a finite- dimensional fc- vector space). This is one version of Hilbert ’s Nullstellensatz, 
a deep result we already mentioned in Example 4.15 and that we will prove (in an 
important class of examples) in §VH.2.2. 

David Hilbert’s name is associated to another important result concerning 
finite-type P-algebras: if P is Noetherian (as a ring, that is, as an P-module) 

31 We are mostly interested in the commutative case, so we will make this hypothesis here; 
the only change in the general case is typographical: rather than [••*]. Also, note that a 

commutative ring is finitely generated as an algebra if and only if it is finitely generated as a 
commutative algebra; cf. Exercise 6.15. 

32 This is particularly unfortunate, since S may very well be an infinite set. 
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and S is a finite-type 5- algebra, then 5 is also Noetherian (as a ring, that is, as 
an 5-module). This is an immediate consequence of the so-called Hilbert’s basis 
theorem. 

The proof of Hilbert’s basis theorem is completely elemental}': it could be given 
here as an exercise, with a few key hints; we will see it in §V.1.1. 


Exercises 


6.1. > Prove Claim 6.3. [§6.3] 

6.2. Prove or disprove that if R is ring and M is a nonzero 5-module, then M is 
not isomorphic to M © M. 


6.3. Let R be a ring, M an 5-module, and p : M — > M an 5-module homo- 
morphism such that p 2 = p. (Such a map is called a projection.) Prove that 
M kerp©imp. 

6.4. t> Let R be a ring, and let n > 1. View as a submodule of 5®", via 

the injective homomorphism 5®( n -0 ^ 5®” defined by 

(n,.. i ► (ri,...,r„_i,0). 


Give a one-line proof that 


[§6.4] 


RS(n-l) 


S 5 . 


6.5. > (Notation as in §6.3.) For any ring R. and any two sets A\ , A 2 , prove that 
(JJ®A,)®/t 2 ^ #®(A,xAj) [§VIII.2.2] 

6.6. -1 Let R be a ring, and let F = 5®" be a finitely generated free 5-module. 
Prove that Hom^.Mod(f r > ^) — F. On the other hand, find an example of a ring R 
and a nonzero 5-module M such that Homn.Mod(AL 5) = 0. [6.8] 


6.7. > Let A be any set. 

• For any family {M a } ae A of modules over a ring 5, define the product ["LeA M a 

and coproduct M„. If M a = 5 for all a € A , these are denoted R A , 5®^, 

respectively. 

• Prove that Z N ^ Z® N . (Hint: Cardinality.) 


[§ 6 . 1 , 6 . 8 ] 

6.8. Let 5 be a ring. If A is any set, prove that Hom/j_Mod(5®' 4 , 5) satisfies 
the universal property for the product of the family {5 a } ae ^, where R a — R for 
all a; thus, Horn R,Moi(R® A ,R) — R A . Conclude that HomR.M w )(5®‘ 4 ,5) is not 
isomorphic to R 9A in general (cf. Exercises 6.6 and 6.7.) 


6.9. -> Let 5 be a ring, F a nonzero free 5-module, and let <p : M -» N be a 
homomorphism of 5-modules. Prove that <p is onto if and only if for all 5-module 
homomorphisms a : F — > N there exists an 5-module homomorphism P : F — » M 
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such that a = fi o <p. (Free modules are projective, as we will see in Chapter VIII.) 
[7.8, Vl.5.5] 

6.10. t> (Cf. Exercise 1.5.12.) Let M, N, and Z be 17-modules, and let p : M Z, 
v : N — > Z be homomorphisms of 17-modules. 

Prove that 17-Mod has ‘fibered products’: there exists an 17-module M x z N 
with 17-module homomorhisnis km : M Xz N —* M, it n : M XzN —* N, such that 
fionM = voir n, and which is universal with respect to this requirement. That is, for 
every 17-module P and 17-module homomorphisms <pM : P — * M, : P -* N such 
that fio <pm = vo<PN, there exists a unique 17-module homomorphism P — ► M x 2 IV 
making the diagram 



commute. The module M xz N may be called the pull-back of M along v (or of 
N along p, since the construction is symmetric). ‘Fiber diagrams’ 


Mx z N — 

-s >N 

1 ° 

[ 

M — 

-*z 


are commutative, but ‘even better’ than commutative; they are often decorated by 
a square, as shown here. j§ 6 . 1 , 6 . 11 , §IX.1.4] 

6 . 11 . > Deline a notion of fibered coproduct of two 17-modules M, N, along an 
17-module A, in the style of Exercise 6.10 (and cf. Exercise 1.5.12) 

A — >N 

M ► N<3> a M 

Prove that fibered coproducts exist in 17- Mod. The fibered coproduct M ©4 N is 
called the push-out of M along v (or of N along p)- [§6.1] 

6.12. Prove Proposition 6.2. 

6 . 13 . Prove that every homomorphic image of a finitely generated module is finitely 
generated. 

6 . 14 . 0 Prove that the ideal (®i, xi, . . . ) of the ring 17 = Z\xi , xi, . . . j is not finitely 
generated (as an ideal, i.e., as an /^module). [§6.4] 

6 . 15 . > Let P. be a commutative ring. Prove that a commutative 17-algebra 5 is 
finitely generated as an algebra over 17 if and only if it is finitely generated os a 
commutative algebra over 17. (Cf. §6.5.) [§6.5] 
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6.16. > Let J?. be a ring. A (left-)R-module M is cyclic if M = (m) for some 
m 6 M. Prove that simple modules (cf. Exercise 5.4) are cyclic. Prove that an 
R-module M is cyclic if and only if M = R/I for some (leftr)ideal I. Prove that 
every quotient of a cyclic module is cyclic. [6.17, §VI.4.1] 

6.17. -■ Let M be a cyclic ff-module, so that M 3 i R/I for a (left-)ideal I (Exer- 
cise 6.16), and let N be another R-module. 

• Prove that Hom^_Mod(M, N) = (n € N | (V« e I), an = 0). 

• For a, b € Z, prove that Hom^_Mod(Z/aZ, Z/bZ) = Z/ gcd(a, b)Z. 

[ 7 . 7 ] 

6.18. > Let M be an R-module, and let N be a submodule of M. Prove that if N 
and M/N are both finitely generated, then M is finitely generated. [§6.4] 


7 . Complexes and homology 

In many contexts, modules arise not ‘one at a time* but in whole series: for example, 
a real manifold of dimension d has one ‘homology’ group for each dimension from 0 
to d. It is necessary to develop a language capable of dealing with whole sequences 
of modules at once. This is the language of homological algebra, of which we will 
get a tiny taste in this section, and a slightly heartier course in Chapter IX. 

7.1. Complexes and exact sequences. A chain complex of R-modules (or, for 
simplicity, a complex) is a sequence of R-modules and R-module homomorphisms 

A+2 dl+1 , i, A . A-l 

> Mi + 1 > Mi ► Mi_ i » • • • 

such that (Vi) : <k o j =0. 

The notation (M«,d*) may be used to denote a complex, or simply M, for 
simplicity (but do not forget that the homomorphisms d, are part of the information 
carried by a complex). 

A complex may be infinite in both directions; ‘tails’ of 0’s are (usually) omitted. 
Several possible alternative conventions may be used: for example, indices may be 
increasing rather than decreasing, giving a co chain complex (whose homology is 
called cohomology; this will be our choice in Chapter IX). Such choices are clearly 
mathematically immaterial, at least for the simple considerations which follow. 

The homomorphisms di are called boundary, or differentials, due to important 
examples from geometry. Note that the defining condition 

dj o dj+i = 0 

is equivalent to the requirement 

im C kerdj. 

We cany in our minds an image such as 
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when we think of a complex. The ovals are the modules Mf, the fat black dots are 
the 0 elements; the gray ovals, getting squashed to zero at each step, are the kernels; 
and we thus visualize the fact that the image of ‘the preceding homomorphism’ falls 
inside the kernel of ‘the next homomorphism’. 

The picture is inaccurate in that it hints that the ‘difference’ between the image 
of di + i and the kernel of <k (that is, the areas colored in a lighter shade of gray) 
should be the same for all i; this is of course not the case in general. In fact, almost 
the whole point about complexes is to ‘measure’ this difference, which is called the 
homology of the complex (cf. §7.3). We say that a complex is exact ‘at M*’ if it has 
no homology there; that is, 

imdj + i = kerdj. 

Visually, 



This complex appears to be exact at the oval in the middle. 

For example, if M,- = a trivial module (usually denoted simply by 0), then the 
complex is necessarily exact at Mi, since then im = kerdj = 0. 

A complex is exact and is often called an exact sequence if it is exact at all its 
modules. 

Example 7.1. A complex 


►(> >L— 

is exact at L if and only if a is a monomorphism. 

Indeed, exactness at L is equivalent to ker a = image of the trivial homomor- 
phism 0 — > L, that is, to 

ker £8 = 0. 

This is equivalent to the injectivity of a (Proposition 6.2). j 
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Example 7.2. A complex 

>-0 

is exact at N if and only if P is an epimorphism. 

Indeed, the complex is exact at N if and only if im P = kernel of the trivial 
homomorphism N —> 0, that is, im/? = N. _j 

Definition 7.3. A short exact sequence is an exact complex of the form 

0 ► L — ^ M -?—> N >0. j 

As seen in the previous two examples, exactness at L and N is equivalent to a 
being injective and fl being surjective. The extra piece of data carried by a short 
exact sequence is the exactness at M, that is, 

imo = ker f); 

by the first isomorphism theorem (Corollary 5.16), we then have 

M M 
ker /? im a 

All in all, we have good material to work on some more Pavloviaa conditioning: at 
the sight of a short exact sequence as above, the reader should instinctively identify 
L with a submodule of M (via the injective map a) aud N with the quotient M/L 
(via the isomorphism induced fay the surjective map /?, under the auspices of the 
first isomorphism theorem). 

Short exact sequences abound in nature. For example, a single homomorphism 
'P ■. M —> M' gives rise immediately to a short exact sequence 

0 >kerip ► M ►im tp ► 0 . 

hi fact, one important reason to focus on short exact sequences is that this obser- 
vation allows us to break up every exact complex into a large number of short exact 
sequences: contemplate the impressive diagram 



im = ker d; 
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The diagonal sequences are short exact sequences, and they interlock nicely by the 
exactness of the horizontal complex. 

This observation simplifies many arguments; cf. for example Exercise 7.5. 

7.2. Split exact sequences. A particular case of short exact sequence arises by 
considering the second projection from a direct sum: Mi ® M 2 — » M 2 ; there is then 
an exact sequence 

0 ► Mj > Mi © M 2 ► Mi > 0 , 

obtained by identifying Mi with the kernel of the projection. These short exact 
sequences are said to ‘split’; more generally, a short exact sequence 

0 •- > Mi N > M 2 > 0 

‘splits’ if it is isomorphic to one of these sequences in the sense that there is a 
commutative diagram 



0 >M[ >M{©MJ >M' 2 M) 


in which the vertical maps are all isomorphisms 33 . 

Example 7.4. The exact sequence of Z-inodules 

0 >Z— >0 

is not split. j 

Splitting sequences give us the opportunity to go back to a question we left 
dangling at the end of §6.2: what should we make of the condition of ‘having a 
left- (reap., right-) inverse’ for a homomorphism? We realized that this condition is 
stronger than the requirement of being a monomorphisms (resp., an epimorphism); 
can we give a more explicit description of such morphisms? 

Proposition 7.5. Let <p : M —* N be an R-module homomorphism. Then 

• <p has a left-inverse if and only if the sequence 

0 > M — N ► coker <p > 0 

splits. 

• <p has a right-inverse if and only if the sequence 

0 > kery > M — — ► N ► 0 

splits. 

^In fact, this last requirement is somewhat redundant; cf. Exercise 7.11. 
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Proof. We will prove the first part and leave the other as an exercise to the reader 
(Exercise 7.6). 

If the sequence splits, then <f> may be identified with the embedding of M into 
a direct sum M ® M 1 , and the projection M © M' —* M gives a left-inverse of tp. 
Conversely, assume that <p has a left-inverse ip: 

0 >M— "Z-tN 



M 


Then we claim that N is isomorphic to M © ker ip and that <p corresponds to the 
identification of M with the first factor: M — » M ® ker ip = N. The isomorphism 
M © ker ip -> N is given by 

(m, k) *-> <p( m) + k; 

its inverse N M © ker ip is 

n (ip(n), n - >pip(n)). 

The element n - ptp{n) is in ker-0 as it should be, since 

ip(n — <pip{n)) = ip(n) — tppip(n) = ip(n) - ip(n) = 0. 

All necessary verifications are immediate and are left to the reader. □ 

Because of Proposition 7.5, P-modnle homomorphisms with a left-inverse are 
called split monomorphisms, arid homomorphisms with a right-inverse are called 
split epimorphisms. 

We will come hack to split exact sequences (in the more demanding context 
of Grp) in §IV.5.2 and then later again when we return to modules and, more 
generally, to abelian categories (Chapters VIII and IX). 


7.3. Homology and the snake lemma. 

Definition 7.6. Hie i-th homology of a complex 

M m : > Mi+i Mi - » i > ■ • • 

of A-moduleg is the P-module 




ker di 
im dj+i 


j 


That is, is a module capturing the ‘light gray annulus’ in the heuristic 

picture of a complex. Of conrse 

= 0 4=> imdi+i = ker di the complex M. is exact at Mi : 

that is, the homology modules are a measure of the ‘failure of a complex from being 
exact’. 
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Example 7.7. In fact, homology should be thought of as a (vast) generalization 
of the notions of kernel and cokernel. Indeed, consider the (very) particular case in 
which M. is the complex 

0 *Mi s-0 • 


Then 


Hi(Mt) = kery>, Hq(M,) = coker <£. 


j 


We will end this very brief excursion into more abstract territories by indicating 
how a commutative diagram involving two short exact sequences generates a ‘long 
exact sequence’ in homology. This is actually a particular case of a more general 
construction — according to which a suitable commutative diagram involving three 
complexes yields a really long ‘long exact homology sequence’. We will come back to 
this general construction when we deal more extensively with homological algebra 
in Chapter IX. The reader is also likely to learn about it in a course on algebraic 
topology, where this feet is put to impressive use in studying invariants of manifolds. 


In the simple form we will analyze, this is affectionately known as the snake 
lemma. Consider two short exact sequences linked by homomorphisms, so as to 
form a commutative diagram 34 : 


-4 Li -^4 Mi — —4 Ni 


-40 


0- 


-4 Lo ■ 


00 * 

-4 Mo — —4 No 


-40 


Lemma 7.8 (The snake lemma). With notation as above, there is an exact sequence 


0- 


► ker A - 


• ker p - 


■ kert'- 


► coker A — > coker p — > coker v — > 0 . 


Remark 7.9. Most of the homomorphisms in this sequence are induced in a com- 
pletely straightforward way from the corresponding homomorphisms A, p, v. The 
one ‘surprising’ homomorphism is the one denoted <5; we will discuss its definition 
below. j 


Remark 7.10. In view of Example 7.7, we could haw written the sequence in this 
statement as 


0 


-> H,(i.) 

Cho (Z) 


4ff,(M.) 
4 Ho{M,) 


4 


3 


4 H 0 (N,) >0 


where X, is the complex 0 > Xo ► Xi > 0 , etc. The snake lemma 

generalizes to arbitrary complexes X., M., JV., producing a ‘long exact homology 

34 In fact, it is better to view this diagram os three (very short) complexes linked by R- 
raodule homomorphisms a*, & so that ‘the rows are exact’. In fact, one can define a category 
of complexes, and this diagram is nothing but a ‘short exact sequence of complexes*; this is the 
approach we will take in Chapter IX. 
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sequence’ of which this is just the tail end. As mentioned above, we will discuss 
this rather straightforward generalization later (§IX.3.3). j 

Remark 7.11. A popular version of the snake lemma does not assume that c»i is 
injective and /Jo is surjective: that is, we could consider a commutative diagram of 
exact sequences 

L\ " m -fr M x t* N\ ) o 

i A 1' I' 

0 • La i Mq ■ 4 No 

The lemma will then state that there is ‘only’ an exact sequence 


' coker A ■ 


► coker ft ■ 


► coker v 


Proving the snake lemma is something that should not be done in public, and 
it is notoriously useless to write down the details of the verification for others to 
read: the details are all essentially obvious, but they lead quickly to a notational 
quagmire. Such proofs are collectively known as the sport of diagram chase, best 
executed by pointing several fingers at different ports of a diagram on a blackboard, 
while enunciating the elements one is manipulating and stating their fate 35 . 

Nevertheless, we should explain where the ‘connecting’ homomorphism 6 comes 
from, since this is the heart of the statement of the snake lemma and of its proof. 
Here is the whole diagram, including kernels and cokernels; thus, columns are exact 
(as well as the two original sequences, placed horizontally): 

0 0 0 







0 0 0 

By the way, we trust that the reader now sees why this lemma is called the snake 
lemma. 


®®Reol purists chose diagrams in arbitrary categories, thus without the benefit of talking 
about ‘elements’, and we will practice this skill later on (Chapter IX). For example, the snake 
lemma can be proven by appealing to universal property after universal property of kernels and 
cokernels, without ever choosing elements anywhere. But the performing technique of pointing 
fingers at a board while monologuing through the argument remains essentially the same. 
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Definition of the snaking homomorphism 5. Let a € ker v. We claim that a 
can be mapped through the diagram all the way to coker A, along the solid arrows 
marked here: 


0 

0 

0 


0 0 0 


0 
0 
0 

0 0 0 




A 

< 

I'* 

“0 . . 

k a 


I 


/ 


Indeed, 

• ker v C JVi; so view a as an element b of A r j. 

• fills surjective, so 3c G Mi, mapping to b. 

• Let d = p(c) be the image of c in Mo- 

• What is the image of d in the spot marked *'! By the commutativity of the 
diagram, it must be the same as v(b). However, b was the image in N\ of 
a € ker v, so i/(b) = 0. Thus, d € ker 0$. Since rows are exact, ker fio = im c*o; 
therefore, 3e € Lq, mapping to d. 

• Finally, let / € coker p be the image of e. 

We want to set 5(a) := /. 

Is this legal? At two steps in the chase we have taken preimages: 

• 3c € Mi such that fii (c) = b, 

• 3e G Lo such that ao(e) = d. 

The second step does not involve a choice: because oro is injective by assumption, 
so the element e mapping to d is uniquely determined by d. But there was a choice 
involved in the first step: in order to verify that 5 is well-defined, we have to show 
that choosing some other c would not affect the proposed value / for 5(a). 

This is proved by another chase. Here is the relevant part of the diagram: 

on A . , „ 

••••. c >6 0 

\\ 

e 

l 

I 



0 
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Suppose we choose a different d mapping to the same b: 

0 - •■c' - 0. 

Then (3i(d — c) = 0; by exactness, 3p € £j such that (d — c) = ai(ff): 


0 g-2L+(d- c )-2->0 0. 

Now the point is that, since columns form complexes, g dies in coker A: 

0 9 ° ‘ > (d -c) — 0 0 



l 

0 


and it follows (by the commutativity of the diagram and the injectivity of «o) that 
changing c to d modifies e to e + A (g) and / to / + 0 = /. That is, / is indeed 
independent of the choice. 

Thus S is well-defined! 


This is a tiny part of the proof of the snake lemma, but it probably suf- 
fices to demonstrate why reading a written-out version of a diagram chase may 
be supremely uninformative. 

The rest of the proof (left to the reader(!) but we are not listing this as an 
official exercise for fear that someone might actually turn a solution in for grading) 
amounts to many, many similar arguments. The definition of the maps induced 
on kernels and cokernels is substantially less challenging than the definition of the 
connecting morphism S described above. Exactness at most spots in the sequence 


“ ker A — > ker ft — > ker v — > coker A — > coker fi — » coker v - 


is also reasonably straightforward; most of the work will go into proving exactness 
at kerv and coker A. 

Dear reader: don’t shy away from trying this, for it is excellent, indispens- 
able practice. Miss this opportunity and you will forever feel unsure about such 
manipulations. 


The snake lemma streamlines several facts, which would not be hard to prove 
individually, but become really straightforward once the lemma is settled. For 
example. 

Corollary 7.12. In the same situation presented in the snake lemma (notation as 
in §7 .3), assume that ji is surjective and v is injective. Then A is surjective and v 
is on isomorphism. 


Proof. Indeed, (i surjective => coker p = 0; v injective => ker v = 0 (Proposi- 
tion 6.2). Feeding this information into the sequence of the snake lemma gives an 
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exact sequence 

0 > ker A > ker p > 0 > coker A > 0 > coker v > 0 

Exactness implies coker A = coker v = 0 (Exercise 7.1); hence A and v are surjective, 
with the stated consequences. □ 

Several more such statements may be proved just as easily; the reader should 
experiment to his or her heart’s content. 


Exercises 


7.1. > Assume that the complex 

>0 >M >0 *••• 

is exact. Prove that M 2f 0. [§7.3] 

7.2. Assume that the complex 

>0 >M >M' >0 

is exact. Prove that M = M'. 

7.3. Assume that the complex 

• • • > 0 -» L - + M — M' - -» N - » 0 • • ■ 

is exact. Show that, up to natural identifications, L = kery and N = coker <p. 

7.4. Construct short exact sequences of Z-modules 

0 >Z® N >Z® N »Z K) 

and 

0 > z® N — > z® N — * z® N — > 0. 

(Hint: David Hilbert’s Grand Hotel.) 

7.5. i> Assume that the complex 

>L *M >A r ♦••• 

is exact and that L and N are Noetherian. Prove that M is Noetherian. [§7.1j 

7.6. > Prove the ‘split epimorphism’ part of Proposition 7.5. [§7.2] 

7.7. t> Let 

0 >M > jV *■ P >0 

be a short exact sequence of E-modules, and let L be an E-module. 

(i) Prove that there is an exact sequence 36 

0 * HomR. Mw i(P, L) + HoEi 3 ,Mo<i(!V , L) > Hom/?„ Mo< )(M, L). 

36 In general, this will be a sequence of abelian groups; if R is commutative, so that each 
Hom^Mod is an /^-module (§5.2), then it will be an exact sequence of /^-modules. 
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(ii) Redo Exercise 6.17. (Use the exact sequence 0 —* I —* R —* R./I —* 0.) 

(in) Construct an example showing that the rightmost homomorphism in (i) need 
not be onto. 

(iv) Show that if the original sequence splits, then the rightmost homomorphism 
in (i) is onto. 

[7.9, Vin.3.14, §V1II.5.1] 

7.8. > Prove that every exact sequence 

0 > M >N >F >0 

of R modules, with F free, splits. (Hint: Exercise 6.9.) [§VTII.5.4] 

7.9. Let 

0 >M >N >0 

be a short exact sequence of F-modules, with F free, and let L be an J?.-module, 
Prove that there is an exact sequence 

0 — > Hoinft.Mod(F, L) — > Homi?_Mad(A r , L) — > Homj?_Mo<i(A'/, L) — ► 0 . 

(Cf. Exercise 7,7.) 

7.10. ▻ In the situation of the snake lemma, assume that A and v are isomorphisms. 
Use the snake lemma and prove that p is an isomorphism. This is called the ‘short 
five-leuuna,’ as it follows immediately from the five-le mm a, (cf. Exercise 7.14), as 
well as from the snake lemma. [VIII.6.21, DC.2.4] 

7.11. > Let 

(*) 0 > Mi f N > M 2 > 0 

be an exact sequence of ii-modules. (This may be called an ‘extension’ of M 2 
by M 2 .) Suppose there is any ff-module homomorphism N — ► M\ © M 2 making 
the diagram 

0 > M\ > N * Mi > 0 

l 
l 

4 * 

0 ► Mi ► Mi ® Mi ► M 2 ► 0 

commute, where the bottom sequence is the standard sequence of a direct sum. 
Prove that (*) splits. [§7.2] 

7.12. -> Practice your diagram chasing skills by proving the ‘four-lemma’: if 


Ai- 

— >Bi- 

— >c 

1 

1 

~^D! 

1 

I- 

I' 


7 


Aq — 

— ^ Bo — 

— >Ca — 

— 


is a commutative diagram of F-modules with exact rows, a is an epimorphism, and 
/?, S are monomorphisms, then 7 is an isomorphism. [7.13, IX.2.3] 
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7.13. Prove another 37 version of the ‘four-lemma’ of Exercise 7.12: if 
B\ - ■ ► Ci ■■ - ► D\ ■ -► Ei 

1' 1’ 1’ i- 

Bo >C 0 ► Bo ► Eo 

is a commutative diagram of ft-modules with exact rows, ft and 6 are epimorphisms, 
and 6 is a monomorphism, then 7 is an isomorphism. 


7.14. -1 Prove the ‘five-lemma’: if 


■► Bi " ■ " ► Ci " ■ "► Ei ■■ E\ 

7 s e 

Ao ■ ► Bq m ■ ► Co ► Bo " " ► Eo 

is a commutative diagram of B-modules with exact rows, ft and S are isomorphisms, 
a is an epimorphism, and e is a monomorphism, then 7 is an isomorphism. (You can 
avoid the needed diagram chase by pasting together results from previous exercises.) 
[7.10] 

7.15. -> Consider the following commutative diagram of E-modules: 



0 

0 

0 



I 


0 

>m 2 

a 

►Mj 

13 

► Mo 

0 



*0 


•►0 


40 


Assume that the three rows are exact and the two rightmost columns are exact. 
Prove that the left column is exact. Second version: assume that the three rows 
are exact aud the two leftmost columns are exact; prove that the right column is 
exact. This is the ‘nine-lemma’. (You can avoid a diagram chase by applying the 
snake lemma; for this, you will have to turn the diagram by 90°.) [7.16] 


7.16. In the same situation as in Exercise 7.15, assume that the three rows are 
exact and that the leftmost and rightmost columns are exact. 

• Prove that a is a monomorphism and ft is an epimorphism. 

• Is the central column necessarily exact? 


37 It is in fact unnecessary to prove both versions, but to realize this one has to view the 
matter from the more general context of abelian categories; cf. Exercise IX.2.3. 
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(Hint: No. Place Z © Z in the middle, and surround it artfully with six copies of Z 
and two 0’s.) 

• Assume further that the central column is a complex (that is, /? o a = 0); prove 
that it is then necessarily exact. 

7.17. -> Generalize the previous two exercises as follows. Consider a (possibly 
infinite) commutative diagram of /f-modules: 

1 1 1 

0 > ij+i * -Mj+i > N i+ i > 0 

I I I 

0 >Li >Ni Ml 

I I I 

0 > Li - 1 >• Mi_i > Ni-i > 0 

I I I 


in which the central column is a complex and every row is exact. Prove that the 
left and right columns are also complexes. Prove that if any two of the columns 
are exact, so is the third. (The first part is straightforward. The second part 
will take you a couple of minutes now due to the needed diagram chases, and a 
couple of seconds later, once you learn about the long exact (co) homology sequence 
in §IX.3.3.) pX.3.12] 




Chapter IV 


Groups, second encounter 


In this chapter we return to Grp and study several topics of a less ‘general’ nature 
than those considered in Chapter II. Most of what we do here will apply exclusively 
to finite groups; this is an important example in its own right, as it has spectacular 
applications (for example, in Galois theory; cf. §VII.7), and it is a good subject 
from the expository point of view, since it gives us the opportunity to see several 
general concepts at work in a context that is complex enough to carry substance, 
but simple enough (in this tiny selection of elementary topics) to be appreciated 
easily. 


1. The corrugation action 

1.1. Actions of groups on sets, reminder. Groups really shine when yon let 
them act on something. This section will make this point very effectively, since 
we will get surprisingly precise results on finite groups by extremely simple-minded 
applications of the elementary facts concerning group actions that we established 
back in §11.9. 

Recall that we proved (Proposition EL9.9) that every transitive (left-) action of 
a group G on a set S is, up to a natural notion of isomorphism, ‘left-multiplication 
on the set of left-cosets G/H’. Here, H may be taken to be the stabilizer Stabc(o) 
of any element a € S, that is (Definition II.9.8) the subgroup of G fixing a. This 
fact applies to the orbits of every left-action of G on a set; in particular', the number 
of elements in a finite orbit O equals the index of the stabilizer of any a € 0\ in 
particular (Corollary II.9.10) the number of elements \0\ of an orbit must divide 
the order |t?| of G, if G is finite. 

These considerations may be packaged into a useful ‘counting’ formula, which 
we could call the class formula for that action; this name is usually reserved to the 
particular case of the action of G onto itself by conjugation, which we will explore 
more carefully below. 
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In order to state the formula, assume G acts on a set S; for a € S, let G a 
denote the stabilizer Stab<s(a). Also, let Z be the set of fixed points of the action: 

Z = {a € S | (Vp g G) : ga = a}. 

Note that a € Z <=*• G a = G; we could say that a € Z if and only if the orbit of 
a is ‘trivial’, in the sense that it consists of a alone. 

Proposition 1.1. Let S be o finite set, and let G be a group acting on S. With 
notation as above, 

PI =\Z\ + G “], 

a&A 

where ACS has exactly one element for each nontrivial orbit of the action. 
Proof. The orbits form a partition of S, and Z collects the trivial orbits; hence 

|S| = |Z| + £KU 

a£A 

where O a denotes the orbit of a. By Proposition II.9.9, the order |O a | equals the 
index of the stabilizer of o, yielding the statement. □ 

The main strength of Proposition 1.1 rests in the fact that, if G is iinite, each 
summand [G : G a ] divides the order of G (and is > 1). This can be a strong 
constraint, when some information is known about |G|. For example, let’s see what 
this says when G is a p-group: 

Definition 1.2. A p-group is a finite group whose order is a power of a prime 
integer p. j 

Corollary 1.3. Let G be a p-group acting on a finite set S, and let Z be the fixed 
point set of the action. Then 


\Z\ = |S| modp. 

Proof. Indeed, each summand [G : G 0 ] in Proposition 1.1 is a power of p larger 
than 1; hence it is 0 mod p. □ 

For instance, in certain situations this can be used to establish 1 that Z ^ 0: see 
Exercise 1.1. Such immediate consequences of Proposition. 1.1 will assist us below, 
in the proof of Sylow’s theorems. 


Mn this sense. Proposition 1.1 is on instance of a class of results known as ‘fixed point 
theorems’. The reader will likely encounter a few such theorems in topology courses, where the 
role of the ‘size’ of a set may be played by (for example) the Euler characteristic of a topological 
space. 
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1.2. Center, centralizer, conjugacy classes. Recall (Example 11.9.3) that ev- 
ery group G acts on itself in at least two interesting ways: by (left-) multiplication 
and by conjugation. The latter action is defined by the following p:G xG — > G: 

p(g,a) = gag -1 . 

As we know (§11.9.2), this datum is equivalent to the datum of a certain group 
homomorphism: 

n : G — * Sq 

from G to the permutation group on G. 

This action highlights several interesting objects: 

Definition 1.4. The center of G, denoted Z(G), is the subgroup ker rr of G. j 

Concretely, the center 2 of G is 

Z(G) = {ge G|(V«€ G ) : ga = ag}. 

Indeed, <r(g) is the identity in So if and only if o(g) acts as the identity on G; that 
is, if and only if gag -1 = a for all a € G; that is, if and only if g commutes with all 
elements of G. In other words, the center is the set of fixed points in G under the 
conjugation action. 

Note that the center of a group G is automatically normal in G: this is nearly 
immediate to check ‘by hand’, but there is no need to do so since it is a kernel by 
definition and kernels are normal. 

A group G is commutative if and only if Z(G) = G, that is, if and only if the 
conjugation action is trivial on G. 

la general, feel happy when you discover that the center of a group is not trivial: 
this will often allow you to set up proofs by induction on the number of elements 
of the group, by mod-ing out by the center (this is, roughly, how we will prove the 
first Sylow theorem). Or note the following useful fact, which comes in handy when 
trying to prove that a group is commutative: 

Lemma 1.5. Let G be a finite group, and assume G/Z(G) is cyclic. Then G is 
commutative (and hence G/Z(G) is in fact trivial). 

Proof. (Cf. Exercise 1.5.) As G/Z(G) is cyclic, there exists an element g € G such 
that the class gZ(G) generates G/Z(G). Then Vo € G 

aZ(G) = (gZ(G)V 

for some reZ; that is, there is an element z € Z(G) of the center such that 
a = g r z. 

If now o, b are in G, use this fact to write 

o = g r z, b = g’w 

for some s € Z and w € Z(G); but then 

ab = (g r z)(g s w) = g r+s zw — ( g s w)(g r z ) = ba, 

2 Why ‘Z’l ‘Center’ if! jjmttum OUf SVutft. 
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where we have used the fact that z and w commute with every element of G. As a 
and b were arbitrary, this proves that G is commutative. □ 

Next, the stabilizer of a € G under conjugation has a special name: 

Definition 1.6. The centralizer (or normalizer) Zc(a) of a e G is its stabilizer 
under conjugation. _i 

Thus, 

Zg(o) = {$ e G \ 9 ag ~ 1 = a} = {geG\ga = ag} 

consists of those elements in G which commute with a. In particular, Z(G) C Z(j{a) 
for all <2 € G; in fact, Z(G) = fj a€G Zo(a). Clearly a e Z(G) *=> Za(a) = G. 

If there is no ambiguity concerning the group G containing a, the index G may 
be dropped. 

Definition 1.7. The conjugacy class of a € G is the orbit [a] of a under the 
conjugation action. Two elements a, b of G are conjugate if they belong to the 
same conjugacy class. j 

The notation [a] is not standard; C(o) is used more frequently, but we are not 
fond of it. Using [a] reminds us that these are nothing but the equivalence classes 
of elements of G under a certain interesting equivalence relation. 

Note that [o] = { 0 } if and only if gag -1 = a for all g e G; that is, if and only 
if ga = ag for all g € G; that is, if and only if a € Z(G). 

1.3. The Class Formula. The ‘official’ Class Formula for a finite group G is the 
particular case of Proposition 1.1 for the conjugation action. 

Proposition 1.8 (Class formula). Let G be a finite group. Then 

\G\ = \Z(G)\ + Y,[G-Z(a)\, 

a€A 

where A C G is a set containing one representative for each nontrivial conjugacy 
class in G. 

Proof. The set of fixed points is Z(G), and the stabilizer of 0 is the centralizer Z(a) ; 
apply Proposition 1.1. □ 

The class formula is surprisingly useful. In applying it, keep in mind that every 
summand on the right (that is, both \Z(G)\ and each [G : Z(a) j) is a divisor of |G|; 
this fact alone often suffices to draw striking conclusions about G. 

Possibly the most famous such application is to p-groups, via Corollary 1.3: 

Corollary 1.9. Let G be a nontrivial p-group. Then G has a nontrivial center. 

Proof. Since \Z(G)\ = |G| modp and |G| > 1 is a power of p, necessarily \Z(G)\ is 
a multiple of p. As Z(G) 0 (since ec € Z(G)), this implies \Z(G)\ > p. □ 
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For example, it follows immediately (from Corollary 1.9 and Lemma 1.5; cf. Ex- 
ercise 1 .6) that if p is prime, then every group of order p 2 is commutative. 

In general, the class formula poses a strong constraint on what can go on in a 
group. 

Example 1.10. Consider a group G of order 6; what are the possibilities for its 
class formula? 

If G is commutative, then the class formula will tell us very little: 

6 = 6 . 

If G is not commutative, then its center must be trivial (as a consequence of La- 
grange’s theorem and Lemma 1.5); so the class formula is 6 = 1 + • ■ • , where ■ ■ - 
collects the sizes of the nontrivial conjugacy classes. But each of these summands 
must be larger than 1, smaller than 6, and must divide 6; that is, there are no 
choices: 

6=l+2+3 

is the only possibility. The reader should check that this is indeed the class formula 
for S3; in fact, S3 is the only noncommutative group of order 6 up to isomorphism 
(Exercise 1.13). j 

Another useful observation is that normal subgroups must be unions of conju- 
gacy classes: because if IT is a normal subgroup, a £ H, and b = gap -1 is conjugate 
to a, then 

begHg-^H. 

To stick with the |G| = 6 example, note that every subgroup of a group must 
contain the identity and its size must divide the order of the group; it follows that 
a normal subgroup of a noncommutative group of order 6 cannot have order 2, since 
2 cannot be written as sums of orders of conjugacy classes (including the class of 
the identity). 

1.4. Conjugation of subsets and subgroups. We may also act by conjugation 
on the power set of G: if A C G is a subset and g £ G, the conjugate of A is 
the subset gAg~ l . By cancellation, the conjugation map a •-» gag~ 1 is a Injection 
between A and gAg~ l . 

This leads to terminology analogous to the one introduced in §1.2. 

Definition 1.11. The normalizer Ng(A) of A is its stabilizer under conjugation. 
The centralizer of A is the subgroup Zq(A) C Nq{A) fixing each element of A. _i 

Thus, g £ Nq{A) if and only if 3 gAg~ l = A, and g € Zq(A) if and only if 
Va e A, gag~ x = a. 

For A = {a} a singleton, we have Na({ a}) = Zc({a}) = Zq{o). In general, 
Za(A) g N G (A). 

If H is a subgroup of G, every conjugate gHg~ y of H is also a subgroup of G; 
conjugate subgroups have the same order. 

3 If A is finite (but not in general), this condition is equivalent to gAg~ l C .4. 
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Remark 1.12. The definition implies immediately that If C Nq(H) and that H 
is normal in G if and only if Nq(H) = G. More generally, the normalizer Nq(H) 
of H in G is (clearly) the largest subgroup of G in which H is normal. j 

One could apply Proposition 1.1 to the conjugation action on subsets or sub- 
group; however, there are too many subsets, and one has little control over the 
number of subgroups. Other numerical considerations involving the number of 
conjugates of a given subset or subgroups may be very useful. 

Lemma 1.13. Let H C G be a subgroup. Then (if finite) the number of subgroups 
conjugate to H equals the index [G : N G (H)} of the normalizer of H in G. 


Proof. This is again an immediate consequence of Proposition II.9.9. □ 

Corollary 1.14. If [G : H\ is finite, then the number of subgroups conjugate to H 
is finite and divides [G : if]. 


Proof. 


[G : if] = [G : iVc(if)] ■ [N a (H) : H] 


(cf. §11.8.5). 


□ 


One of the celebrated Sylow theorems will strengthen this statement substan- 
tially in the case in which H is a maximal p- group contained in a finite group G. 
For a statement concerning the size of the normalizer of an arbitrar y p-subgroup of 
a group, see Lemma 2.9. 

Another useful numerical tool is the observation that if H and K are subgroups 
of a group G and H C Nq{K) — so that gKg ~ 1 = K for all g € if — then conjuga- 
tion by g € if gives an automorphism of K. Indeed, we have already observed that 
conjugation is a bisection, and it is immediate to see that it is a homomorphism: 
VhMeK 


(d*iff~ l )(dfc 2 ff -1 ) = gh(g~ l g)k 2 g~ 1 = g(kik 2 )g~ 1 . 

Thus, conjugation gives a set- function 

y ■ H —* Aut Cr( ,(fO. 

The reader will check that this is a group homomorphism and will determine ker y 
(Exercise 1.21). 

This is especially useful if H is finite and some information is available con- 
cerning Aut Gtp (AT) (for an example, see Exercise 4.14). A classic application is 
presented in Exercise 1.22. 
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1.1. t> Let p be a prime integer, let G be a p-group, and let 5 be a set such that 
|5| ^ Omodp. If G acts on S, prove that the action must have fixed points. [§1.1, 
§2.31 

1.2. Find the center of Di n - (The answer depends on the parity of n. Use a 
presentation.) 

1.3. Prove that the center of S n is trivial for » > 3. (Suppose that <r e S n sends 
a to b ^ a, and let c ^ a, b. Let t be the permutation that acts solely by swapping 
b and c. Then compare the action of err and r<r on a.) 

1.4. > Let G be a group, and let N be a subgroup of Z(G). Prove that N is normal 
in G. [§2.2] 

1.5. t> Let G be a group. Prove that G/Z{G) is isomorphic to the group Inn(G) of 
inner automorphisms of G. (Cf. Exercise II.4.8.) Then prove Lemma 1.5 again by 
using the result of Exercise II.6.7. [§1.2] 

1.8. t> Let p, q be prime integers, and let G be a group of order pq. Prove that 
either G is commutative or the center of G is trivial. Conclude (using Corollary 1.9) 
that every group of order p 2 , for a prime p, is commutative. [§1.3] 

1.7. Prove or disprove that if p is prime, then every group of order p 3 is conunu- 
tative. 

1.8. > Let p be a prime number, and let G be a p-group: |G| = p r . Prove that G 
contains a normal subgroup of order p k for every noimegative k < r. [§2.2] 

1.9. -> Let p be a prime number, G a p-group, and H a nontrivial normal subgroup 
of G. Prove that H n Z(G) ^ {e}. (Hint: Use the class formula.) [3.11] 

1.10. Prove that if G is a group of odd order and g 6 G is conjugate to g~ l , then 
9 = e G . 

1.11. Let G be a finite group, and suppose there exist representatives g \ , . . , , g r of 
the r distinct coiyugacy classes in G, such that Vi, j, g^j = gjyi. Prove that G is 
commutative. (Hint: What can you say about the sizes of the conjugacy classes?) 

1.12. Verify that the class formula for both Dg and Qg (cf. Exercise III. 1.12) is 
8 = 2 + 2 + 2 + 2. (Also note that Eg P Qg.) 

1.13. > Let G be a noncommutative group of order 6. As observed in Example 1.10, 
G must have trivial center and exactly two conjugacy classes, of order 2 and 3. 

• Prove that if every element of a group has order < 2, then the group is commu- 
tative. Conclude that G has an element y of order 3. 

• Prove that (y) is normal in G. 

• Prove that [y] is the conjugacy class of order 2 and \y) = {y,y 2 }. 

• Prove that there is an x e G such that yx = xy 2 . 
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• Prove that x has order 2. 

• Prove that x and y generate G. 

• Prove that G = S 3 . 

[§l-3, §2.5] 

1.14. Let G be a group, and assume [G : Z{G)\ — n is finite. Let A C G be any 
subset. Prove that the number of conjugates of A is at most n. 

1.15. Suppose that the class formula for a group G is 60 = 1 + 15 4- 20 + 12 + 12. 
Prove that the only normal subgroups of G are {e} and G. 

1.16. t> Let G be a finite group, and let H C G be a subgroup of index 2. For 
<x € H, denote by [a]//, reap., [a]c, the conjugacy class of o in if, resp., G. Prove 
that either [a]// = [a]<j or (a]// is half the size of [a]G, according to whether the 
centralizer Za(a) is not or is contained in if . (Hint: Note that H is normal in G, 
by Exercise II.8.2; apply Proposition II.8.11.) [§4.4] 

1.17. -1 Let if be a proper subgroup of a finite gr oup G. Prove that G is not the 
union of the conjugates of if. (Hint: You know the number of conjugates of if; 
keep in mind that any two subgroups overlap, at least at the identity.) [1.18, 1.20] 

1.18. Let 5 be a set endowed with a transitive action of a finite group G, and 
assume |S| > 2. Prove that there exists a 3 € G without fixed points in S, that 
is, such that gs 5^ $ for all $ 6 S. (Hint: By Proposition II.9.9, you may assume 
S = G/H, with if proper in G. Use Exercise 1.17.) 

1.19. Let if be a proper subgroup of a finite group G. Prove that there exists a 
g € G whose conjugacy class is disjoint from H. 

1.20. Let G = GL2OC), and let if be the subgroup consisting of upper triangular 
matrices (Exercise II.6.2). Prove that G is the union of the conjugates of if. Thus, 
the tiniteness hypothesis in Exercise 1.17 is necessary. (Hint: Equivalently, prove 
that every 2x2 matrix is conjugate to a matrix in H. You will use the fact that 
C is algebraically closed: see Example III.4.14.) 

1.21. > Let if, K be subgroups of a group G, with H C Nq{K). Verify that the 
function 7 : if — * AntGn>(if ) defined by conjugation is a homomorphism of groups 
and that ker7 = if fl Zq(K), where Zq(K) is the centralizer of K. [§1.4, 1.22] 

1.22. t> Let G be a finite group, and let H be a cyclic subgroup of G of order p. 
Assume that p is the smallest prime dividing the order of G and that if is normal 
in G. Prove that H is contained in the center of G. 

(Hint: By Exercise 1.21 there is a homomorphism 7 : G — ► AutGrp(if); by 
Exercise 11.4. 14, Autcrp(ff) has order p — 1. What can you say about 7V) [§1.4] 


2. The Sylow theorems 

2.1. Cauchy’s theorem. The ‘Sylow theorems’ consist of three statements con- 
cerning p-subgroups (cf. Definition 1.2) of a given finite group G. The form we will 
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give for the first of these statements will tell us that G contains p- groups of all sizes 
allowed by Lagrange’s theorem: if p is a prime and p k divides |G|, then G contains 
a subgroup of order p k . The proof of this statement is an easy induction, provided 
the statement for k = 1 is known: that is, provided that one has established 

Theorem 2.1 (Cauchy’s theorem). Let G be a finite group, and let p be a prime 
divisor of |G|. Then G contains an element of order p. 

As it happens, only the abelian version of this statement is needed for the proof 
of the first Sylow theorem; then the full statement of Cauchy’s theorem follows 
from the first Sylow theorem itself. Since the (diligent) reader has already proved 
Cauchy’s theorem for abelian groups (in Exercise II.8.17), we could directly move 
on to Sylow theorems. 

However, there is a quick proof 4 of the full statement of Cauchy’s theorem 
which does not rely on Sylow and is a good illustration of the power of the general 
‘class formula for arbitrary actions’ (Proposition 1.1). We will present this proof, 
while also encouraging the reader to go back and (re)do Exercise II.8.17 now. 

Proof of Theorem 2.1. Consider the set S of p-tuples of elements of G: 

■ • • > a p} 

such that ai a p = e. We claim that |S| = |G| P ~ 1 : indeed, once «i , . . . , «p-i are 
chosen (arbitrarily), then a p is determined as it is the inverse of oi • Op_j. 
Therefore, p divides the order of S as it divides the order of G. 

Also note that if «u ■ • ■ a p = e, then 

02 • ■ • a p a\ — e 

(even if G is not commutative): because if oj is a left-inverse to 02 • • • a p , then it is 
also a rights in verse to it. 

Therefore, we may act with the group ZjpZ on S: given [m] in Z/pZ, with 
0 < m < p, act by [mj on 

{^1 

by sending it to 

{utn+i » • * . > a p , Oi, . . . , } - 
as we just observed, this is still an element of S. 

Now Corollary 1.3 implies 

\Z\ = |S'| = 0 mod p, 

where Z is the set of fixed points of this action. Fixed points are p-tuples of the 
form 

(*) {a,...,o}; 

and note that Z / t), since {e, ...,e} € Z. Since p > 2 and p divides \Z\, we 
conclude that \Z\ > 1; therefore there exists some element in Z of the form (*), 
with o ^ e, 

4 This argument ie apparently due to James McKay. 
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This says that there exists an a € G, a ^ e, such that a p = e, proving the 
statement. □ 

We should remark that the proof given here proves a more precise result than 
the raw statement of Theorem 2.1: every element of order p in G generates a cyclic 
subgroup of G of order p, and we are able to say something about the number of 
such subgroups. 

Claim 2.2. Let G be a finite group, let p be a prime divisor of |G|, and let N be 
the number of cyclic subgroups of G of order p. Then N = 1 modp. 

The proof of this fact is left to the reader (as an incentive to really understand 
the proof of Theorem 2.1). 

Claim 2.2, coupled with the simple observation that if there is only 1 cyclic 
subgroup H of order p, then that subgroup must be normal (Exercise 2.2), suffices 
for interesting applications. 

Definition 2.3. A group G is simple if its only normal subgroups are {e} and G 
itself. j 

Simple groups occupy a special place in the theory of groups: one can ‘break 
up’ any finite group into basic constituents winch are simple groups; we will see 
how this is done in §3.1. Thus, it is important to be able to tell whether a group is 
simple or not 5 . 

Example 2.4. Let p be a positive prime integer. If |G| = mp, with 1 < m < p, 
then G is not simple. 

Indeed, consider the subgroups of G with p elements. By Claim 2.2, the number 
of such subgroups is = lmodp. Thus, if there is more than oue such subgroup, 
then there must be at least p+ 1. Any two distinct subgroups of prime order can 
only meet at the identity (why'/); therefore this would account for at least 

l+(p+l)(p-l) = p 2 

elements in G. Since \G\ = mp < p 2 , this is impossible. Therefore there is only 
one cyclic subgroup of order p in G, which must be normal as mentioned above, 
proving that G is not simple. j 

2.2. Sylow I. Let p be a prime integer. A p-Sylow subgroup of a finite group G 
is a subgroup of order p r , where |G| = p T m and (p,m) = 1. That is, P C G is a 
p-Sylow subgroup if it is a p-group and p does not divide [G : P]. 

If p does not divide the order of |G|, then G contains a p-Sylow subgroup: 
namely, {e}. This is not very interesting; what is interesting is that G contains a 
p-Sylow subgroup even when p does divide the order of G: 

Theorem 2.5 (First Sylow theorem). Every finite group contains a p-Sylow sub- 
group, for all primes p. 

5 In fact, a complete list of all finite simple groups is known: this is the classification result 
mentioned at the end of §11.6.3, arguably one of the deepest and hardest results in mathematics. 
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The first Sylow theorem follows from the seemingly stronger statement: 
Proposition 2.6. If p k divides the order of G, then G has a subgroup of order p k . 

The statements are actually easily seen to be equivalent, by Exercise 1.8; in 
any case, the standard argument proving Theorem 2.5 proves Proposition 2.6, and 
we see no reason to hide this fact. Here is the argument: 

Proof of Proposition 2.6. If k = 0, there is nothing to prove, so we may assume 
k > 1 and in particular that |G| is a multiple of p. 

Argue by induction on |G|: if |G| = p, again there is nothing to prove; if |G| > p 
and G contains a proper subgroup H such that [G : £T] is relatively prime to p, 
then p k divides the order of H, and hence H contains a subgroup of order p k by 
the induction hypothesis, and thus so does G. 

Therefore, we may assume that all proper subgroups of G have index divisible 
by p. By the class formula (Proposition 1.8), p divides the order of the center Z(G). 
By Cauchy’s theorem®, 3u € Z{G) such that a has order p. The cyclic subgroup 
N = (a) is contained in Z(G), and hence it is normal in G (Exercise 1.4). Therefore 
we can consider the quotient G/N. 

Since |G/JV| = |G|/p and p k divides |G| by hypothesis, we have that p (! ~ 1 
divides the order of G/N. By the induction hypothesis, we may conclude that 
G/N contains a subgroup of order p k ~ k . By the structure of the subgroups of 
a quotient (§11.8.3, especially Proposition II.8.9), this subgroup must be of the 
form P/N, for P a subgroup of G. 

But then \P\ — |P/1V| • |JV| = p k ~ l ■ p = p fc , as needed. □ 

There are slicker ways to prove Theorem 2.5. We will see a pretty (and in- 
sightful) alternative in §2.3; but the proof given above is easy to remember and is 
a good template for similar arguments. 

Remark 2.7. The diligent reader worked out in Exercise II.8.20 a stronger state- 
ment than Proposition 2.6, for abelian groups. The arguments are similar; the 
advantage in the abelian case is that any cyclic subgroup produced by Cauchy’s 
theorem is automatically normal, while ensuring normality requires a few twists 
and turns in the general case (and, as a result, yields a weaker statement). _i 

2.3. Sylow II. Theorem 2.5 tells us that some maximal p-group in G attains 
the largest size allowed by Lagrange’s theorem, that is, the maximal power of the 
prime p dividing |G|. 

One can be more precise: the second Sylow theorem tells us that every maximal 
p-group in |G| is in fact a p-Sylow subgroup. It is as large as is allowed by Lagrange’s 
theorem. 

The situation is in fact even better: all p-Sylow subgroups are conjugates of 
each other 7 . Moreover, even better than this, every p-group inside G must be 
contained in a conjugate of any fixed p-Sylow subgroup. 

®Note that, as mentioned in §2.1, we only need the abelian case of this theorem. 

7 Of course if P is a p-Sylow subgroup of G, then so are all conjugates gPg~ 3 of P. 
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The proof of this very precise result is very easy! 

Theorem 2.8 (Second Sylow theorem). Let G be a finite group, let P be a p-Sylow 
subgroup, and let H C G be a p-group. Then H is contained in a conjugate of P: 
there exists g e G suck that H C gPg ~ l . 

Proof. Act with H on the set of left-cosets of P, by left-multiplication. Since there 
are [G : P] cosets and p does not divide [G : P], we know this action must have 
fixed points (Exercise 1.1): let gP be one of them. This means that Vh e H; 

hgP = gP\ 

that is, g~ 1 hgP = P for all h in H; that is, g~ 1 Hg C P; that is, H C gPg -1 , as 
needed. □ 

We can obtain an even more complete picture of the situation. Suppose we 
have constructed a chain 

H 0 = {e}CHiC...CH k 

of p-subgroups of a group G, where |ff t | = p*. By Theorem 2.8 we know that H k 
is contained in some p-Sylow subgroup, of order p r - the maximum power of p 
dividing the order of G. But we claim that the chain can in fact be continued one 
step at a time all the way up to the Sylow subgroup: 

Ho = {e} c Hi c ■ ■ • c H k c Jfc+i C...CH T ; 

and, further, H k may be assumed to be normal in H k + 1 - The following lemma will 
simplify the proof of this fact considerably and will also help us prove the third 
Sylow theorem. 

Lemma 2.9. Let H be a p-group contained in a finite group G. Then 
[N a (H) :H] = [G:H\ mod p. 

Proof. If H is trivial, then Ng(H) = G and the two numbers are equal. 

Assume then that H is nontrivial, and act with H on the set of left-cosets of 
H in G, by left-multiplication. The fixed points of this action are the cosets gH 
such that Vft e H 

hgH = gH, 

that is, such that g~ 1 hg G H for all h € H; in other words, H C gHg~ x , and hence 
(hy order considerations) gHg~ l = H. This means precisely that g € Nq(H). 
Therefore, the set of fixed points of the action consists of the set of cosets of H in 
N C (H). 

The statement then follows immediately from Corollary 1.3. □ 

As a consequence, if H k is not a p-Sylow subgroup ‘already’, in the sense that p 
‘still’ divides [G : ffj,], then p must also divide [Nc{H k ) : H k \. Another application 
of Cauchy’s theorem tells us how to obtain the next subgroup H k+l in the chain. 
More precisely, we have the following result. 
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Proposition 2.10. Let H be a p-subgroup of a finite group G, and assume that H 
is not a p-Sylow subgroup. Then there exists a p-subgroup H' of G containing H, 
such that [if' : H] = p and H is normal in H' . 

Proof. Since H is not a p-Sylow subgroup of G, p divides [Afc(H) : H\, by 
Leninia 2.9. Since H is normal in Nc(H), we may consider the quotient group 
Nq(H)/H, and p divides the order of this group. By Theorem 2.1, Nq(H)/H has 
an element of order p; this generates a subgroup of order p of Nq(H)/H , which 
must be (cf. §11.8.3) in the form H'/H for a subgroup H' of Nq{H). 

It is straightforward to verify that H' satisfies the stated requirements. □ 

The statement about ‘chains of p-subgroups’ follows immediately from this 
result. 

Note that Cauchy’s theorem and Proposition 2.10 provide a new proof of Propo- 
sition 2.6 and hence of the first Sylow theorem. 

2.4. Sylow III. The third (and last) Sylow theorem gives a good handle on the 
number of p-Sylow subgroups of a given finite group G. This is especially useful in 
establishing the existence of normal subgroups of G: since all p-Sylow subgroups 
of a group are conjugates of each other (by the second Sylow theorem), if there is 
only one p-Sylow subgroup, then that subgroup must be normal 8 . 

Theorem 2.11. Letp be a prime integer, and let G be a finite group of order |G| = 
p r m. Assume thatp does not divide m. Then the number of p-Sylow subgroups of 
G divides rn end is congruent to 1 modulo p. 

Proof. Let N p denote the number of p-Sylow subgroups of G. 

By Theorem 2.8, the p-Sylow subgroups of G are the conjugates of any given 
p-Sylow subgroup P. By Lemma 1.13, N p is the index of the uormalizer Na(P) 
of P; thus (Corollary 1.14) it divides the index m of P. In fact, 

m = [G : P] = [G : N C (P)] • [N G (P) : P] = N p ■ [ N G {P ) : P]. 

Now, by Lemma 2.9 we have 

m = [G:P] = [ N C {P ) : P] mod p; 
multiplying by N p , we get 

mNp = m mod p. 

Since m 0 mod p and p is prime, this implies 

N p = l mod p, 

as needed. □ 

Of course there are other ways to prove Theorem 2.11: see for example Exer- 
cise 2.11. 


8 Fbr an alternative viewpoint, see Exercise 2.2. 




200 


IV. Groups, second encounter 


2.5. Applications. Consequences stemming from the group actions we have en- 
countered, and especially the Sylow theorems, may be applied to establish exquisitely 
precise facts about individual groups as well as whole classes of groups; this is often 
based on some simple but clever numerology. 

The following examples are exceedingly simple-minded but will hopefully con- 
vey the flavor of what can be done with the tools we have built in the previous 
two sections. More examples may be found among the exercises at the end of this 
section. 

2.5.1. More nonsimple groups. 

Claim 2.12. Let G be a group of order mp r , where p is a prime integer and 
1 < ro < p. Then G is not simple. 

(Cf. Example 2.4.) 

Proof. By the third Sylow theorem, the number N p of p-Sylow subgroups divides 
m and is in the form 1 + kp. Since m < p, this forces k = 0, N p — 1. Therefore G 
has a normal subgroup of order p r ; hence it is not simple. □ 

Of course the same argument gives the same conclusion for every group of order 
mp r , where (m,p) = 1 and the only divisor d of m such that d = 1 mod p is d = 1. 

Example 2.13. There are no simple groups of order 2002. 

Indeed 9 , 

2002 = 2 • 7 • 11 • 13; 

the divisors of 2 • 7 • 13 are 

1, 2 , 7, 13, 14, 26 , 91, 182: 

of these, only 1 is congruent to 1 mod 11. Thus there is a normal subgroup of 
order 11 in every group of order 2002. j 

The reader should not expect the third Sylow theorem to always yield its fruits 
so readily, however. 

Example 2.14. There are no simple groups of order 12. 

Note that 3=1 mod 2 and 4=1 mod 3: thus the argument used above does not 
guarantee the existence of either a normal 2-Sylow subgroup or a normal 3-Sylow 
subgroup. 

However, suppose that there is more than one 3- Sylow subgroup. Then there 
must be 4, by the third Sylow theorem. Since any two such subgroups must intersect 
in the identity, this accounts for exactly 8 elements of order 3. Excluding these 
leaves us with the identity and 3 elements of order 2 or 4; that is just enough room 
to fit one 2-Sylow subgroup. This suhgronp will then have to he normal. 

Thus, either there is a 3-Sylow normal suhgronp or there is a 2-Sylow normal 
subgroup— either way, the group is not simple. _i 

®It is safe to guess that this statement has been assigned on hundreds of algebra tests across 
the world in the year 2002. 
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Even this more refined counting will often fail, and one has to dig deeper. 
Example 2.15. There are no simple groups of order 24. 

Indeed, let G be a group of order 24, and consider its 2-Sylow subgroups; by 
the third Sylow theorem, there are either 1 or 3 such subgroups. If there is 1, the 
2-Sylow subgroup is normal and G is not simple. Otherwise, G acts (nontrivially) 
by conjugation on this set of three 2-Sylow subgroups; this action gives a nontrivial 
homomorphism G — *• S 3 , whose kernel is a proper, nontrivial normal subgroup of 
G— thus again G is not simple. _i 

The reader should practice by selecting a random number n and trying to say 
as much as he/she can, in general, about groups of order «. Beware: such problems 
are a common feature of qualifying exams. 

2.5.2. Groups of order pq,p<q prime. 

Claim 2.16. Assume p< q are prime integers and 1 mod p. Let G be a group 

of order pq. Then G is cyclic. 

Proof. By the third Sylow theorem, G has a unique (hence normal) subgroup H 
of order p. Indeed, the number N p of p-Sylow subgroups must divide q, and q is 
prime, so N v = 1 or q. Necessarily N v = lmodp, and q =£ 1 rnodp by hypothesis; 
therefore N p = 1. 

Since H is normal, conjugation gives an action of G on H, hence (by Exer- 
cise 1.21) a homomorphism 7 : G — * Aut(ff). Now H is cyclic of order p, so 
| Aut(i/)| = p— 1 (Exercise 4.14); the order of 7(G) must divide both pq and p — 1, 
and it follows that 7 is the trivial map. 

Therefore, conjugation is trivial on H: that is, H C Z(G). Lemma 1.5 implies 
that G is abelian. 

Finally, an abelian group of order pq, with p < q primes, is necessarily cyclic: 
indeed it must contain elements g, h of order p, q, respectively (for example by 
Cauchy’s theorem), and then \gh\ = pq by Exercise HI. 14. □ 

For example, this statement ‘classifies’ all groups of order 15, 33, 35, 51, . . . : 
such groups are necessarily cyclic. 

The argument given in the proof is rather ‘high-brow’, as it involves the auto- 
morphism group of H; that is precisely why we gave it. For low-brow alternatives, 
see Exercise 2.18 or Remark 5.4. 

The condition qqk 1 mod p in Claim 2. 16 is clearly necessary': indeed, |Sa| = 2-3 
is the product of two distinct primes, and yet S 3 is not cyclic. The argument given 
in the proof shows that if G = \pq\, with p < q prime, and G has a normal subgroup 
of order p, then G is cyclic. If q = lmodp, it can be show'll that there is in feet 
a unique noncommutative group of order pq up to isomorphism: the reader will 
work this out after learning about semidirect products (Exercise 5.12). But we are 
in feet already in the position of obtaining rather sophisticated information about 
this group, even without knowing its construction in general (Exercise 2.19). 

For fun, let’s tackle the case in which p = 2. 
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Claim 2.17. Let g be an odd prime, and let G be a noncommutative group of 
order 2 q. Then G = D^q, the dihedral group. 

Proof. By Cauchy’s theorem, 3y € G such that y has order q. By the third Sylow 
theorem, (y) is the unique subgroup of order q in G (and is therefore normal). 
Since G is not commutative and in particular it is not cyclic, it has no elements of 
order 2q\ therefore, every element in the complement of (y) has order 2; let x he 
any such element. 

The conjugate xyx~ l of y by x is an element of order q, so xyx" 1 6 (y). Thus, 
xyx~ l = y T for some r between 0 and 9—1. 

Now observe that 

(y T Y ~ ( xyx~ 1 ) T = xy T x~ l = x 2 y(x~ 1 ) 2 = y 
since \x\ = 2. Therefore, y T *~ l = e, which implies 

g |( r 2 -l) = (r-l)(r+l) 

by Corollary 11.1.11. Since q is prime, this says that q | (r - 1) or q | (r + 1); since 
0 < r < g - 1, it follows that r = 1 or r = <? - 1. 

If r = 1, then xyx~ l = y\ that is, xy = yx. But then the order of xy is 2 q (by 
Exercise ILL 14), and G is cyclic, a contradiction. 

Therefore r = q — 1, and we have established the relations 

' x 2 = e, 

. yi = e, 

yx = xy 9 " 1 . 

These are the relations satisfied by generators x, y of Di q , as the reader hopefully 
verified in Exercise II.2.5; the statement follows. □ 

Claim 2.17 yields a classification of groups of order 2q, for q an odd prime: such 
a group must be either abelian (and hence cyclic, by the usual considerations) or 
isomorphic to a dihedral group. For q = 3, we recover the result of Exercise 1.13: 
every noncommutative group of order 6 is isomorphic to Ds — S3. 


Exercises 


2.1. > Prove Claim 2.2. [§2.1] 


2.2. > Let G be a group. A subgroup H of G is characteristic if <p(H) C H for 
every automorphism ip of G. 


• Prove that characteristic subgroups are normal. 

• Let H C K C G, with H characteristic iu K and K normal in G. Prove that H 
is normal in G. 

• Let G, K be groups, and assume that G contains a single subgroup H isomorphic 
to K. Prove that H is normal in G. 
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• Let K be a normal subgroup of a finite group G, and assume that |jftT| and \G/K\ 
are relatively prime. Prove that K is characteristic in G. 

[§2.1, §2.4, 2.13, §3.3) 

2.3. Prove that a nonzero abelian group G is simple if and only if G = Z/pZ for 
some positive prime integer p. 

2.4. e> Prove that a group G is simple if and only if its only homomorphic images 
(i.e., groups G' such that there is an onto homomorphism G — * G') are the trivial 
group and G itself (up to isomorphism). [§3.2] 

2.5. Let G be a simple group, and assume ip : G -* G' is a nontririal group 
homomorphism. Prove that <p is injective. 

2.6. Prove that there are no simple groups of order 4, 8, 9, 16, 25, 27, 32, or 49. 
hi fact, prove that no p-group of order > p 2 is simple. 

2.7. Prove that there are no simple groups of order 6, 10, 14, 15, 20, 21, 22, 26, 
28, 33, 34, 35, 38, 39, 42, 44, 46, 51, 52, 55, 57, or 58. (Hint: Example 2.4.) 

2.8. Let G be a finite group, p a prime integer, and let N be the intersection of 
the p-Sylow subgroups of G. Prove that N is a normal p-subgroup of G and that 
every normal p-subgroup of G is contained in A r . (In other words, G/N is final 
with respect to the property of being a homomorphic image of G of order |G|/p“ 
for some a.) 

2.9. -i Let P be a p-Sylow subgroup of a finite group G, and let H C G be a p- 
subgroup. Assume H C Nq(P). Prove that H C P. (Hint: P is normal in Nq{P), 
so PH is a subgroup of Na(P) by Proposition II.8.11, and \PH/P\ = \H/(PnH)\. 
Show that this implies that PH is a p-group, and hence PH = P since P is a 
maximal p-subgroup of G. Deduce that H C P.) [2.10) 

2.10. -■ Let P be a p-Sylow subgroup of a finite group G, and act with P by 
conjugation ou the set of p-Sylow subgroups of G. Show that P is the unique fixed 
point of this action. (Hint: Use Exercise 2.9.) [2.11] 

2.11. t> Use the second Sylow theorem, Corollary 1.14, and Exercise 2.10 to paste 
together an alternative proof of the third Sylow theorem. [§2.4] 

2.12. Let P be a p-Sylow subgroup of a Suite group G, and let H C G be a 
subgroup containing the normalizer Nc{P)- Prove that [G : H\ = lmodp. 

2.13. -> Let P be a p-Sylow subgroup of a finite group G. 

• Prove that if P is normal in G, then it is in fact characteristic in G (cf. Exer- 
cise 2.2). 

• Let H C G be a subgroup containing the Sylow subgroup P. Assume P is 
normal in H and H is normal in G. Prove that P is normal in G. 

• Prove that Nc(Ng(P)) = P. 

[3.12] 

2.14. Prove that there are no simple groups of older 18, 40, 45, 50, or 54. 
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2.15. Classify all groups of order n < 15, n ^ 8, 12: that is, produce a list of 
nonisomorphic groups such that every group of order n 8, 12, n < 15 is isomorphic 
to one group in the list. 

2.16. -> Let G he a noncommutative group of order 8. 

• Prove that G contains elements of order 4 and no elements of order 8. 

• Let y be an element of order 4. Prove that G is generated by y and by an element 
x g (y), such that x 2 = e or x 2 = y 2 . 

• In either cose, G = {e,y,y 2 ,y i ,x,yx,y 2 x,y i x}. Prove that the multiplication 
table of G is determined by whether x 2 — e or x 2 = y 2 , and by the value of xy. 

• Prove that necessarily xy = p 3 ®. (Hint: Tb eliminate xy = y 2 x, multiply on the 
right by y.) 

• Prove that G = Dg or G = Qg. 

[6.2, VII.6.6] 

2.17. -> Let B be a division ring (Definition UI.1.13), and assume |J?| = 64. Prove 
that B is necessarily commutative (hence, a field), as follows: 

• The group of units of B. has order 63. Prove it has a commutative subgroup G 
of order 9. (Sylow.) 

• Prove that B is the only sub-division ring of B containing G. 

• Prove that the set of elements of R commuting with every element of G is a 
sub-division ring of B containing G. (Cf. Exercise III.2.10.) 

• Conclude that G is contained in the center of R. Recall that the center of R is a 
sub-division ring of R. (cf. Exercise 111.2.9), and conclude that R is commutative. 

Like Exercise III.2.11, this is a particular case of a theorem of Wedderburu, accord- 
ing to which every finite division ring is a field. [V1I.5.16] 

2.18. > Give an alternative proof of Claim 2.16 as follows: use the third Sylow 
theorem to count the number of elements of order p and g in G; use this to show 
that there are elements in G of order neither 1 nor p nor q; deduce that G is cyclic. 
[§2.5] 

2.19. t> Let G be a noncommutative group of order pq, where p < q are primes. 

• Show that g=l modp. 

• Show that the center of G is trivial. 

• Draw the lattice of subgroups of G. 

• Find the number of elements of each possible order in G. 

• Find the number and size of the conjugacy classes in G. 

[§2-5] 

2.20. How many elements of order 7 are there in a simple group of order 168? 

2.21. Let p < q < r be prime integers, and let G be a group of order pqr. Prove 
that G is not simple. 
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2.22. Let G be a finite group, n = |G|, and p be a prime divisor of n. Assume that 
the only divisor of n that is congruent to 1 modulo p is 1. Prove that G is simple. 

2.23. -i Let N p denote the number of p-Sylow subgroups of a group G. Prove that 
if G is simple, then |G| divides N p \ for all primes p in the factorization of |G|. More 
generally, prove that if G is simple and if is a subgroup of G of index N, then |G| 
divides jVL (Hint: Exercise II.9.12.) This problem capitalizes on the idea behind 
Example 2.15. [2.25] 

2.24. t> Prove that there are no noncommntative simple groups of order less than 60. 
If you have sufficient stamina, prove that the next possible order for a nonooimnu- 
tative simple group is 168. (Don’t feel too bad if you have to cheat and look up a 
few particularly troublesome orders > 60.) [§4.4] 

2.25. -» Assume that G is a simple group of order 60. 

• Use Sylow’s theorems and simple numerology to prove that G has either five 
or fifteen 2-Sylow subgroups, accounting for fifteen elements of order 2 or 4. 
(Exercise 2.23 will likely be helpful.) 

• If there are fifteen 2-Sylow subgroups, prove that there exists an element g € G 
of order 2 contained in at least two of them. Prove that the centralizer of g has 
index 5. 

Conclude that every simple group 10 of order 60 contains a subgroup of index 5. 
[4.22] 


3. Composition series and solvability 

We have claimed that simple groups (in the sense of Definition 2.3) are the ‘basic 
constituents’ of all finite groups. Among other things, the material in this section 
will (partially) justify this claim. 

S.l. The Jordan-Holder theorem. A series of subgroups G, of a group G is a 
decreasing sequence of subgroups starting from G: 

G = G 0 2 Gi 2 G 2 $ ■■■ . 

The length of a series is the number of strict inclusions. 

A series is normal if Gi + i is normal in G; for all i. We will be interested in the 
maximal length of a normal series in G; if finite, we will denote this number 11 by 
((G). The number ^(G) is a measure of how far G is from being simple. Indeed, 
((G) = 0 if and only if G is trivial, and ((G) = 1 if and only if G is nontrivial and 
simple: for a simple nontrivial group, the only maximal normal series is 

G2(e}. 

10 The reader will prove later (Exercise 4.22) that there is in tact only one simple group of 
order 60 up to isomorphism and that this group contains exactly five 2-Sylow subgroups. The 
result obtained here will be needed to establish this fact. 

11 There does not appear to be a standard notation for this concept. 
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Definition 3.1. A composition series for G is a normal series 
G=G 0 3G 1 3G 2 3--3G n ={e} 

such that the successive quotients Gi/Gi + 1 are simple. _i 

It is clear (by induction on the order) that finite groups have composition series, 
while infinite groups do not necessarily have one (Exercise 3.3). It is also clear that 
if a normal series has maximal length ((G), then it is a composition series. Wliat is 
not clear is that the converse holds: conceivably, there could exist maximal nonnal 
series of different lengths (the longest ones having length £(G)). For example, why 
can’t there be a finite group G with ((G) — 3 and two different composition series 

G 2 G, 2 G 2 2 {e} 

and 

G 2 G'i 2 {«} 

(that is: a finite group G with ((G) = 3 and a simple normal subgroup G\ such 
that G/G\ is simple)? 

Part of the content of the Jordan-Holder theorem is that (luckily) this cannot 
happen. In fact, the theorem is much more precise: not only do all composition 
series have the same length, but they also have the same quotients (appearing, 
however, in possibly different orders). 

Theorem 3.2 (Jordan-Holder). Let G be a group, and let 

G = Go2G,2G 2 2--2G„ = {e}, 

G = G' 0 2G' l2 G^2... 2 G' ro ={e} 

be two composition series for G. Then m = n, and the lists of quotient groups 
Hi = G,/Gj +1 , HI = G'JG' i+ i agree (up to isomorphism) after a permutation of 
the indices. 

Proof. Let 

(*) G=Go2G l 2G 2 2-..2G„ = { e } 

be a composition series. Argue by induction on n: if n = 0, then G is trivial, and 
there is nothing to prove. Assume n > 0, and let 

(**) C = G'o2Gi2G£a..-2G' m = {e} 

be another composition series for G. If Gt = G\, then the result follows from the 
induction hypothesis, since Gi has a composition series of length n - 1 < n. 

We may then assume Gi yS G[. Note that GiG'j = G: indeed, GiGJ is normal 
in G (Exercise 3.5), and G i g GiGj; but there are no proper normal subgroups 
between G i and G since G/G i is simple. 

Let K = Gi n Gj, and let 

K2K l 3K 2 2 -2K r = {e} 
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be a composition series for K. By Proposition II.8.11 (the “second isomorphism 
theorem”), 

<h _ Ci * G& _G_ G[ ^ G_ 

K Gi n G\ ~ G[ G\ K ~ G\ 

are simple. Therefore, we have two new composition series for G: 

G 2 Cl 2 k 2 tfi 2 2 {e} 

II II II II 

G 2 Ci 2 K 2 Ki 2 • " 2 to 

which only differ at the first step. These two series trivially have the same length 
and the same quotients (the first two quotients get switched from one series to the 
other). 

Now we claim that the first of these two series has the same length and quotients 
as the series (*). Indeed, 

is a composition series for Gi: by the induction hypothesis, it must have the same 
length and quotients as the composition series 

Gi 5 G 2 2 ••• 5 G„ = {e}; 

verifying our claim (and note that, in particular, r = n — 2). 

By the same token, applying the induction hypothesis to the series of length 
n — 1, i.e., 

Gi 2 K 2 K x 2 2 2 /f„_ 2 = {e}, 

shows that the second series has the same length and quotients as (**), and the 
statement follows. □ 

3.2. Composition factors; Schreier’s theorem. Two normal series are equiv- 
alent if they have the same length and the same quotients (up to order). The 
Jordan-Holder theorem shows that any two maximal finite series of a group are 
equivalent. That is, the (isomorphism classes of the) quotients of a composition 
series depend only on the group, not on the chosen series. These are the compo- 
sition factors of the group. They form a multiset 12 of simple groups: the ‘basic 
constituents’ of our loose comment hack in §2. 

It is clear that two isomorphic groups must have the same composition factors. 
Unfortunately, it is not possible to reconstruct a group from its composition factors 
alone (Exercise 3.4). One has to take into account the way the simple groups are 
‘glued’ together; we will come back to this point in §5.2. 

The intuition that the composition factors of a group are its basic constituents 
is reinforced by the following fact: if G is a group with a composition series, then 
the composition factors of every normal subgroup N of G are composition factors 
of G and the remaining ones are the composition factors of the quotient G/N. 

12 See §1.2.2 for a reminder on multisets: they are sets of elements counted with multiplicity. 
For example, the composition factors of S/4 S form the multiset consisting of two copies of S/2S. 
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Example 3.3. Let G = Z/6Z = {[0], [1], [2], [3], [4], [5]}. Then 
(|0]j [1J> [2], [3], [4], [5]} 5 {[0], [3]} 2 {[0]} 

is a composition series for G; the quotients are Z/3Z, Z/2Z, respectively. The (nor- 
mal) subgroup N = {[0], [2], [4]} ‘turns off’ the second factor: indeed, intersecting 
the series with JV gives 

{[0]> [2], [4]} 2 {[0]} = {[0]}, 

a series with composition factor Z/3Z. On the other hand, ‘mod-ing out by N' 
turns off the first factor: keeping in mind [3] + N = [1] 4- JV, etc., we find 

{[0] + JV, [1] + N} = {[0] + JV, [1] + JV} 2 {[0] + JV}, 

a series with lone composition factor Z/2Z. _i 

This phenomenon holds in complete generality: 

Proposition 3.4. Let G be a group, and let N be a normal subgroup of G. Then 
G has a composition series if and only if both N and G/N have composition series. 
Further, if this is the case, then 

e(G) = t(N) + e(G/N), 

and the composition factors of G consist of the collection of composition factors of 
N and of G/N. 

Proof. If G/N has a composition series, the subgroups appearing in it correspond 
to subgroups of G containing JV, with isomorphic quotients, by Proposition II.8.10 
(the “third isomorphism theorem”). Thus, if both G/N and JV have composition 
series, juxtaposing them produces a composition series for G, with the stated con- 
sequence on composition factors. 

The converse is a little trickier. Assume that G has a composition series 
G = Go 2G, 3G 2 3--3G» = {e} 

and that N is a normal subgroup of G. Intersecting the series with N gives a 
sequence of subgroups of the latter: 

JV = G n N D Gj n JV D • • • 3 {e} n JV = { e } 

such that G; + i n JV is normal in Gj n JV, for all i. We claim that this becomes a 
composition series for N once repetitions are eliminated. Indeed, this follows once 
we establish that 

Gj fl JV 
Gi + i n N 

is either trivial (so that Gj+i C\ N = G^n N, and the corresponding inclusion may 
be omitted) or isomorphic to Gi/Gi + 1 (hence simple, and one of the composition 
factors of G). To see this, consider the homomorphism 

Gj 
Gi+i 


Gift N Gi 
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the kernel is dearly G,- + t fl JV; therefore (by the first isomorphism theorem) we have 
an injective homomorphisms 

GjC\N Gj 
G.j + J D N G; + 1 

identifying (G* D A0/(Gi+i f) N) with a subgroup of Gi/Gi+i . Now, this subgroup 
is normal (because N is normal in G) and Gi/Gi+i is simple; our claim follows. 
As for G/N , obtain a sequence of subgroups from a composition series for G: 


G ^ G\N G 2 JV ^ 
N ~ N ~ N ~ 


D 


{*g}N 

N 




such that (Gi+\N)/N is normal in (GjN)/N. As above, we have to check that 


(GjN)/N 

(G i+l N)/N 


is either trivial or isomorphic to Gj/Gj+i- By the third isomorphism theorem, this 
quotient is isomorphic to (GiN)/(Gi+iN). This time, consider the homomorphism 


Gi *-* G { N 


GiN 
G i+l N ’ 


this is surjective (check!), and the subgroup G< + j of the source is sent to the identity 
element in the target; hence (by Theorem 11.7.12) there is an onto homomorphism 


Gj Gj + N 

Gf + i Gj-t-i •+■ N 

Since Gi/Gj+i is simple, it follows that (Gi + N)/(Gi+\ + N) is either trivial or 
isomorphic to it (Exercise 2.4), as needed. 

Summarizing, we have shown that if G has a composition series and N is normal 
in G, then both N and G/N have composition series. The first part of the argument 
yields the statement on lengths and composition factors, concluding the proof. □ 


One nice consequence of the Jordan-Holder theorem is the following observa- 
tion. A series is a refinement of another series if all terms of the first appear in the 
second. 

Proposition 3.6. Any two normal series of a finite group ending with {e} admit 
equivalent refinements. 


Proof. Refine the series to a composition series; then apply the Jordan-Holder 
theorem. □ 


In fact, Schreier’s theorem asserts that this holds for all groups (while the 
argument given here only works for groups admitting a composition series, e.g., 
finite groups). Proving this in general is reasonably straightforward, from judicious 
applications of the second isomorphism theorem (cf. Exercise 3.7). 
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3.3. The commutator subgroup, derived series, and solvability. It has 
been a while since we have encountered a universal object; here is one. For any 
group G, consider the category whose objects are group homomorphisms or : G — » A 
from G to a commutative group and whose morphisms a —* j3 are (as the reader 
should expect) commutative diagrams 



where ip is a homomorphism. 

Does this category have an initial object? That is, given a group G, does there 
exist a commutative group which is universal with respect to the property of being 
a homomorphic image of G'i 
Yes. 

Such a group may well be thought of as the closest ‘commutative approximation’ 
of the given group G. To verify that this universal object exists, we introduce the 
following important notion. (The diligent reader has begun exploring this territory 
already, in Exercise II.7.11.) 

Definition 3.6. Let G be a group. The commutator subgroup of G is the subgroup 
generated by all elements 

pftp - 1 fc _1 

with g, ft € G. j 

The element ghg^hr 1 is often denoted [ 5 , ft] and is called the commutator of 
g and ft. Thus, p, ft commute with each other if and only if [p, ft] = e. 

hi the same notational style, the commutator subgroup of G should be denoted 
[G,G]; this is a bit heavy, and the common shorthand for it is G', which offers 
the possibility of iterating the notation. Thus, G" may be used to denote the 
commutator subgroup of the commutator subgroup of G, and GW denotes the i-th 
iterate. We will adopt this notation in this subsection for convenience, but not 
elsewhere in this book (as we want to be able to ‘prime’ any letter we wish, for any 
reason). 

First we record the following trivial, but useful, remark: 

Lemma 3.7. Let ip : G j — ► G% be a group homomorphism. Then Vp, ft € G i we 
have 

¥>([ 9,h]) = [<fi(g),<p(h ) ) 

and V’(Gx) Q Gj. 

This simple observation makes the key properties of the commutator subgroup 
essentially immediate (cf. Exercise II.7.11): 

Proposition 3.8. Let G' be the commutator subgroup of G. Then 

• G' is normal in G; 

• the quotient G/G' is commutative; 
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• if a : G — » A is a homomorphism of G to a commutative group, then G' C 
kero; 

• the natural projection G — » G/G' is universal in the sense explained above. 

Proof. These are all easy consequences of Lemma 3.7: 

— By Lemma 3.7, the commutator subgroup is characteristic, hence normal 
(cf. Exercise 2.2). 

— By Lemma 3.7, the commutator of any two cosets gG\ hG' is the coset of the 
commutator [ff, h]; hence it is the identity in G/G'. As noted above, this implies 
that G/G' is commutative. 

—Let a : G — » A he a homomorphism to a commutative group. By Lemma 3.7, 
a{G') Q A' = {e}: that is, G' C kero. 

— The universality follows from the previous point and from the universal prop- 
erty of quotients (Theorem II.7.12). □ 

Taking successive commutators of a group produces a descending sequence of 
subgroups, 

G D G' D G" D G'" 2 ■ , 

which is ‘normal’ in the sense indicated in §3.1. 

Definition 3.9. Let G be a group. The derived series of G is the sequence of 
subgroups 

G D G‘ D G" D G'" D • • • • -< 

The derived series may or may not end with the identity of G. For example, 
if G is commutative, then the sequence gets there right away: 

G 2 G' = {e}; 

however, if G is simple and noncommutative, then it gets stuck at the first step: 

G = G' = G" = • • • 

(indeed, G' is normal and ^ {e} as G is noncommutative; but then G‘ — G since 
G is simple). 

Definition 3.10. A group is solvable if its derived series terminates with the iden- 
tity. j 


For example, abelian groups are solvable. 

The importance of this notion will be most apparent in the relatively distant 
future because of a brilliant application of Galois theory (§VIL7.4). But we can 
already appreciate it in the way it relates to the material we just covered. A normal 
series is abelian, resp., cyclic, if all quotients are abelian, resp., cyclic 13 . 

Proposition 3.11. For a finite group G, the following are equivalent: 

(i) All composition factors of G are cyclic. 

(ii) G admits a cyclic series ending in {e}. 

13 Thus, a composition aeries should be called ‘simple’; to our knowledge, it iB not. 
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(Hi) G admits an abelian series ending in {e}. 
(iv) G is solvable. 


Proof, (i) =S> (ii) =t- (iii) are trivial, (iii) =*■ (i) is obtained by refining an 
abelian series to a composition series (keeping in mind that the simple abelian 
groups are cyclic p-groups). 

(iv) =*• (hi) is also trivial, since the derived series is abelian (by the second 
point in Proposition 3.8). 

Thus, we only have to prove (iii) ==t- (iv). For this, let 


G=Go2Gi2<J 2 2---2G«={e} 


be an abelian series. Then we claim that GW C G, for all i, where GW denotes 
the i-th ‘iterated’ commutator subgronp. 

This can be verified by induction. For i = 1, GjG\ is commutative; thus 
G' C G\, by the third point in Proposition 3.8. Assuming we know G^ C G$, the 
fact that Gj/Gj+i is abelian implies G- C Gj+i, and hence 


G ( ’ +1 > = (G (i >)' C G' C G i+ i, 


as claimed. 

In particular we obtain that G^ C G„ = {e}: that is, the derived series 
terminates at {e}, as needed. □ 

Example 3.12. All p-groups are solvable. Indeed, the composition factors of a 
p-group are simple p-groups (what else could they be?), hence cyclic. j 

Corollary 3.13. Let N be a normal subgroup of a group G. Then G is solvable if 
and only if both N and G/N are solvable. 


Proof. This follows immediately from Proposition 3.4 and the formulation of solv- 
ability in terms of composition factors given in Proposition 3.11. □ 


The Feit-Thompson theorem asserts that every finite group of odd order is 
solvable. This result is many orders of magnitude beyond the scope of this book: 
the original 1963 proof rims about 250 pages. 
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Exercises 


3.1. Prove that Z has normal series of arbitrary Lengths. (Thus, f(Z) is uot finite.) 

3.2. Let G be a finite cyclic group. Compute ((G) in terms of |G|. Generalize to 
finite solvable groups. 

3.3. t> Prove that every finite group has a composition series. Prove that Z does 
not have a composition series. [§3.1] 

3.4. ▻ Find an example of two nonisomorphic groups with the same composition 
factors. [§3.2] 

3.5. t> Show that if H, K are normal subgroups of a group G, then HK is a normal 
subgroup of G. (§3.1) 

3.6. Prove that G\ x Go has a composition series if and only if both G\ and G<i 
do, and explain how the corresponding composition factors are related. 

3.7. > Locate and understand a proof of (the general form of) Schreier’s theorem 
that does not use the Jordan-Holder theorem. Then obtain an alternative proof of 
the Jordan-Holder theorem, using Schreier’s. (§3.2] 

3.8. t> Prove Lemma 3.7. [§3.3] 

3.9. Let G be a nontrivial p-group. Construct explicitly an abelian series for G, 
using the fact that the center of a nontririal p- group is nontrivial (Corollary 1.9). 
This gives an alternative proof of the fact that p-groups are solvable (Example 3. 12) . 

3.10. -i Let G be a group. Define inductively an increasing sequence Zq = {e} C 
Zi C Z 2 C • • • of subgroups of G as follows: for i > 1, Z, is the subgroup of G 
corresponding (as in Proposition 11.8.9) to the center of G/Zi-\. 

• Prove that each Zj is normaL in G, so that this definition makes sense. 

A group is 14 nilpotent if Z m = G for some m. 

• Prove that G is nilpotent if and only if G/Z(G) is nilpotent. 

• Prove that p-groups are nilpotent. 

• Prove that nilpotent groups are solvable. 

• Find a solvable group that is not nilpotent. 

[3.11, 3.12, 5.1) 

3.11. -i Let H be a nontrivial normal subgroup of a nilpotent group G (cf. Ex- 
ercise 3.10). Prove that H intersects Z(G) uontrivially. (Hint: Let r > 1 be the 
smallest index such that 3 fe ^ e, fee H(~\Z r . Contemplate a well-chosen commuta- 
tor [<g, fe].) Since p-groups are nilpotent, this strengthens the result of Exercise 1.9. 
[3.14] 

* 4 There are many alternative characterizations for this notion that are equivalent to the one 
given here but not too trivially so. 
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3.12. Let H be a proper subgroup of a finite nilpotent group G (cf. Exercise 3.10). 
Prove that H £ Nq(H). (Hint: Z(G) is nontrivial. First dispose of the case in 
which H does not contain Z(G), and then use induction to deal with the case 
in which H does contain Z(G).) Deduce that every Sylow subgroup of a finite 
nilpotent group is normal 15 . (Use Exercise 2.13.) 

3.13. -■ For a group G, let G^ denote the iterated commutator, as in §3.3. Prove 
that each <?W is characteristic (hence normal) in G. [3.14] 

3.14. Let if be a nontrivial normal subgroup of a solvable group G. 

• Prove that H contains a nontrivial commutative subgroup that is normal in G. 
(Hint: Let r be the largest index such that K = H n G^ is nontrivial. Prove 
that K is commutative, and use Exercise 3.13 to show it is normal in G.) 

• Find an example showing that H need not intersect the center of G nontrivially 
(cf. Exercise 3.11). 

3.15. Let p, q be prime integers, and let G be a group of order p 2 q. Prove that G 
is solvable. (This is a particular case of Burnside’s theorem: for p, q primes, every 
group of order p a q h is solvable.) 

3.16. t> Prove that every group of order < 120 and ^ 60 is solvable. [§4.4, §VII.7.4] 

3.17. Prove that the Feit-Thompson theorem is equivalent to the assertion that 
every noncommutative finite simple group has even order. 


4. The symmetric group 


4.1. Cycle notation. It is time to give a second look at symmetric groups. Recall 
that S n denotes the group of permutations (i.e., automorphisms in Set) of the set 
{1, . . . , n}. In §11.2 we denoted elements of S n in a straightforward but inconvenient 


way: 


= ( l 2 
\& 1 


3 4 5 8 7 8) 
2 7 5 3 4 6; 


would stand for the element in Sg sending 1 to 8, 2 to 1, etc. 


There is clearly “too much” information here (the first row should be implicit), 
aud at the same time it seems hard to find out anything interesting about a per- 
mutation from this notation. For example, can the reader say anything about the 
conjugates of <r in Sg? For maximal enlightenment, try to do Exercise 4.1 now, and 
then tiy again after absorbing the material in §4.2. In short, we should be able to 
do better. 


As often is the case, thinking in terms of actions helps. By its very definition, 
the group S n acts on the set (1, . . . ,n}; so does every subgroup of S„. Given a 
permutation a € S n , consider the cyclic group (cr) generated by a and its action on 
{1, . . . , n}. The orbits of this action form a partition of {1, . . . , n}; therefore, every 


15 Thia property characterizes finite nilpotent groups; cf. Exercise 5.1. 
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a € S n determines a partition of {1, , n.}. For example, the element <r € S$ given 
above splits {1 , . . . , 8} into three orbits: 

{1,2, 3, 6, 8}, {4,7}, {5}. 

The action of (a) is transitive on each orbit. This means that one can get from 
any element of the orbit to any other element and then back to the original one by 
applying a enough times. In the example, 

1 * — + 8 1 — > 6 i — > 3 i — + 2 < — > 1 , 4 i — * 7 i — '> 4, 5 > — > 5. 

Definition 4.1. A (nontrivial) cycle is an element of S n with exactly one nontrivial 
orbit. For distinct ai, . . . ,a r in {1, . . . , n), the notation 

(ai&2 . . . u r ) 

denotes the cycle in S n with nontrivial orbit {ct], . . . , a r } , acting as 
«1 •-» 0,1 >-*•••• t-> « r *-* Ol. 

In this case, r is the length of the cycle. A cycle of length r is called an r-cycle. _i 

The identity is considered a cycle of length 1 in a trivial way and is denoted by 
(1) (and could just as well be denoted by (i) for any i). 

Note that (ai<x j...a r ) = (a 2 -..a r a]) according to the notation introduced 
in Definition 4.1: the notation determines the cycle, bnt a nontrivial cycle only 
determines the notation ‘up to a cyclic permutation’. 

Two cycles are disjoint if their nontrivial orbits are. The following observation 
deserves to be highlighted, bnt it does not seem to deserve a proof: 

Lemma 4.2. Disjoint cycles commute. 

The next one gives us the alternative notation we were looking for. 

Lemma 4.3. Every a 6 S n , o e, can be written as a product of disjoint nontrivial 
cycles, in a unique way up to permutations of the factors. 

Proof. As we have seen, every <r € S n determines a partition of {1, . . . ,n} into 
orbits under the action of (cr ) . If <r ^ e, then (<r) has nontrivial orbits. As o acts 
as a cycle on each orbit, it follows that <r may be written as a product of cycles. 
The proof of the uniqueness is left to the reader (Exercise 4.2). □ 

The cycle notation for o e S n is the (essentially) unique expression of a as a 
product of disjoint cycles found in Lemma 4.3 (or (1) for <r = e). In our running 
example, 

<r = (18632)(47), 

and keep in mind that this expression is unique cum grano salis: we could write 
c = (63218) (47) = (74)(21863) = (32186)(74) = • • • , 
and all of these are ‘the same’ cycle notation for a. 
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4.2. Type and conjugacy classes in S n . The cycle notation has obviously an- 
noying features — such as the not-too-unique uniqueness pointed out a moment ago, 
or the fact that 

(123) 

can be an element of S3 just as well as of Siooooo* and only the context can tell. 
However, it is invaluable as it gives easy access to quite a bit of important informa- 
tion on a given permutation. In fact, much of this information is carried already 
by something even simpler than the cycle decomposition. 

A partition of ail integer n > 0 is a nouincreasiug 16 sequence of positive integers 
whose sum is 71. It is easy to enumerate partitions for small values of n. fbr example, 
5 has 7 distinct partitions: 

5=1+1+1+1+1 
= 2 +l+l+l 
= 2 + 2+1 
=3+1+1 
= 3 + 2 
= 4+1 
= 5. 

The partition Ai > A2 > • • • > A r may be denoted 

[Ax 5 ■ * • > A r ]; 

for example, the fourth partition listed above would be denoted [3, 1, 1]. A nicer 
‘visual’ representation is by means of the corresponding Young (or Ferrers) diagram, 
obtained by stacking Ai boxes on top of A2 boxes on top of A3 boxes on top of. . . . 
For example, the diagrams corresponding to the sewn partitions listed above are 

[U, 1,1,1] [2,1, 1,1] [2,2,1] [3,1,1] [3,2] [4,1] [3] 



Definition 4.4. The type of a e S n is the partition of n given by the sizes of the 
orbits of the action of (o) on {1 , . . . , n}. -1 

It is hopefully clear (from the argument proving Lemma 4.3) that the type of 
<r 6 S n is simply given by the lengths of the cycles in the decomposition of a as 
the product of disjoint cycles, together with as many l’s as needed. In our running 
example, 

<r = (18632) (47) e S 8 

has type [5,2, 1]: 

18 Of course this choice is arbitrary, and uondecreasing sequences would do just as well. 
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The main reason why ‘types’ are introduced is a consequence of the following 
simple observation. 

Lemma 4.5. Let r € S n , and let (oi . . . a r ) be a cycle. Then 
r(oi . . . a r )r -1 = (air -1 . . .a r r -1 ). 

The funny notation air -1 stands for the action of the permutation r -1 on 
ai € {1, . . . , n}; recall that we agreed in §11.2.1 that we would let our pennutations 
act on the right, for consistency with the usual notation for products in groups. 

Proof. This is ■verified by checking that both sides act in the same way on (1, . . . , n}. 
For example, for 1 < i < r 

(a£T -1 )(r(ai . . .a^r -1 ) = aj(ai . . . a r )r -:l = ai+ir -1 
as it should; the other cases are left to the reader. □ 

By the usual trick of judiciously inserting identity factors r -1 r, this formula 
for computing conjugates extends immediately to any product of cycles: 

r(ai . . . Or) • •• (6i ...6 s )r _1 = (air -1 . . .cvr -1 ) • ■ • (6ir -1 ...&„t -1 ). 

This holds whether the cycles are disjoint or not. However, since r is a injection, 
disjoint cycles remain disjoint after conjugation. This is essentially all there is to 
the following important observation: 

Proposition 4.6. Two elements of S n are conjugate in S n if and only if they have 
the same type. 


Proof. The ‘only if’ part of this statement follows immediately from the preceding 
considerations: conjugating a permutation yields a permutation of the same type. 
As for the ‘if’ part, suppose 


<ri = (ai...a r )(6i ...t») -(ci ...ct) 


and 

<t 2 = a' r )(&i ...!/<,)••• (4 ... 4) 


are two permutations with the same type, written in cycle notation, with r > s > 
■■■ > t (so the type is [r, s, . . . , £]). Let r be any permutation such that aj = qJt, 
bj = VjT, . . . , cjt = Cj.t for all i, j, . . . , k. Then Lemma 4.5 implies <t 2 = 
so cri and <r 2 are conjugate, as needed. □ 


Example 4.7. In 5s, 

(18632) (47) and (12345) (67) 

must be conjugate, since they have the same type. The proof of Proposition 4.6 
tells us that 


r(18632)(47)r -1 = (12345)(67) 




218 


IV. Groups, second encounter 


for 

fl 234567 8 \ 

T \1 8 6 3 2 4 7 5/ ’ 

and of course this may be checked by hand in a second. Running this check, and 
especially staring at the second row in r vis-a-vis the cycle notation of the first 
permutation, should clarify everything. _i 

Summarizing, the type (or the corresponding Young diagram) tells us every- 
thing about conjugation in S n . 

Corollary 4.8. The number of conjugacy classes in S n equals the number of par- 
titions of n. 

Fbr example, there are 7 conjugacy classes in Ss, indexed by the Tfetris look- 
alikes drawn above. 

It is also reasonably straightforward to compute the number of elements in each 
conjugacy class, in terms of the type. For example, in order to count the number 
of permutations of type [ 2 , 2 , 1 ] in note that there are 5! = 120 ways to fill the 
corresponding Young diagram with the numbers 1, ... ,5: 


ai 

«2 

a 3 

0.4 

a s 



that is, 120 ways to write a permutation as a product of two 2 -cycles: 


but switching % «-+ 02 and as «-» 04 , as well as switching the two cycles, gives the 
same permutation. Therefore there are 


120 

2 - 2-2 


- 15 


permutations of type [2,2, 1]. Performing this computation for all Young diagrams 
for S 5 gives ns the size of each conjugacy class, that is, the class formula (cf. § 1 . 2 ) 
for Ss : 

120 = 1 + 10 + 15 + 20 + 20 + 30 + 24. 


Example 4.9. There are no normal subgroups of size 30 in S 5 . 

Indeed, normal subgroups are unions of conjugacy classes (§1.3); since the 
identity is in every subgroup and 30 — 1 = 29 cannot be written as a sum of the 
numbers appearing in the class formula for Ss, there is no such subgroup. _j 


This observation will be dwarfed by much stronger results that we will prove 
soon (such as Theorem 4.20, Corollary 4.21); but it is remarkable that such precise 
statements can be established with so little work. 
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4.3. Transpositions, parity, and the alternating group. For n > 1, consider 
the polynomials 

A„ = JJ (Xi-Xj) *„], 

i<i<j<n 


that is. 


A, = 1, 

Az = xi - x 2 , 

A3 = (*1 - * 2 )(®i - x 3 )(x 2 - x 3 ), 

A 4 = (xi - x 2 )(xi - X 3 )(xi - Xi)(x2 - x 3 )(x 2 - X 4 )(l 3 - x 4 ), 


We can act with any <7 € S n on A„, by permuting the indices according to a: 

A 71. ^ := 1 1 (xitr Xjff), 

l<i<j<n 

For example, 

A 4 ( 1234 ) = (®2 - * 3)(*2 - * 4)(«2 - *l )(*3 - X 4 )(«3 ~ «l )(»4 - ®l) = -A4. 

In general, it is clear that A „<r is still the product of all binomials (x{ — xj), where 
the factors are permuted and some factors may change sign in the process. Hence, 
A n <7 = ±A„, where the factor ±1 depends on <r. 

Definition 4.10. The sign of a permutation cr € S n , denoted (—l) 0 , is determined 
by the action of a on A»: 

A„<r = (— l) a A n . 

We say that a permutation is even if its sign is +1 and odd if its sign is —1. _i 

Note that V<r, r € S„ we have 

A„(<rr) = (A„< 7 )r : 

it follows that (— l) ar — (-l)"(-l) r . Viewing {— 1,+1} as a group under multi- 
plication 17 , we see that the ‘sign’ function 

e:5„-»{-l,+l>, s(<r) := (-1) 17 

is a homomorphism. 

Here is a different viewpoint on this sign function. A transposition is a cycle of 
length 2. Every permutation is a product of transpositions: 

Lemma 4.11. Transpositions generate S n . 

Proof. Indeed, by Lemma 4.3 it suffices to show that every cycle is a product of 
transpositions, and indeed 

(®i . . .Or) = (aia 2 )(aia 3 ) ■ ■ ■ (aia r ), 

as may be checked by applying 18 both sides to every element of {1, . . . , n}. □ 


17 This is the group of units in Z; of course it is isomorphic to Cj. 
18 Don’t forget that permutations act on the right. 
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Of course a given permutation may be written as a product of transpositions 
in many different ways. However, whether an even number of transpositions or an 
odd number is needed only depends on a. Indeed, 

Lemma 4.12. Let a = ri • • • ry. be a product of transpositions. Then a is even, 
resp., odd, according to whether r is even, resp., odd. 

Proof. This follows immediately from the facts that e is a homomorphism and the 
sign of a transposition is —1: indeed, (ij) acts on A„ by permuting its factors and 
changing the sign of exactly one factor (namely, the factor +(»< — xj)). □ 

Definition 4.13. The alternating group on (1, . . . , n}, denoted A n , consists of all 
even permutations <r € S n . _i 

The alternating group is a normal subgroup of S„, and 

[S„ :A n ] = 2 

for n > 2: indeed. An = kere, and e is surjective for n > 2. 

It is very easy to tell whether a permutation a belongs to A n in terms of the 
type of a. Indeed (by the argument proving Lemma 4.11) a cyde is even, resp., odd, 
if it has odd, resp., even, length 19 . Pictorially, a permutation <r e S„ is even if and 
only if 7i and the number of rows in the Young diagram of o have the same parity. 
Take n = 5 for example: 


* * # * 



The starred types correspond to even permutations. Adding up the sizes of the 
corresponding conjugacy dasses, 

1 + 15 + 20 + 24 = 60, 
confirms that As has index 2 in 5s- 

However, do not read too much into this computation: we are not claiming 
that these are the sizes of the conjugacy classes in As, only that these are the 
conjugacy dasses in S 5 making up the normal subgroup A5. Conjugacy in A„ is 
rather interesting and ties in nicely with the issue of the simplicity of A„. 

4.4. Conjugacy in A n ; simplicity of A n and solvability of S n . Denote by 
resp., [c]4 n , the conjugacy class of an even permutation 0 in S„, resp., A„. 
Clearly C ; we proceed to compare these two sets. 

Lemma 4.14. Let n > 2, and let 0 € A». Then [<r]/i n = [<r]s n or the size of [<r]^ n 
is half the size of [<t]s b , according to whether the centralizer Zs„(<r ) is not or is 
contained in A n . 

*®Note the unfortunate terminology clash. For example, 3-cycles are even permutations. 
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Proof. (Cf. Exercise 1.16.) Note that 

Z An (a) = An n Zs„ (<r) : 

this follows immediately from the definition of centralizer (Definition 1.6). Now 
recall that the centralizer of <r is its stabilizer under conjugation, and therefore the 
size of the coujugacy class of <r equals the index of its centralizer. 

If Zs n (cr) C A n , then Z A „(a) = Zs„(o), so that 

[5 n : Zs n (a)\ = [S n : Z An (o)] = [^n : : Z An [cr)] = 2 • [At : Z An (a)]; 

therefore, [cr]A n is half the size of [tr]s n in this case. 

If Zs„ (< t) 2 A n , then note that A„Zs„(cr) - S n : indeed, AnZs n (cr) is a sub- 
group of S n (because A n is normal; cf. Proposition II.8.11), and it properly contains 
A n , so it must equal S„ as A„ has index 2 in S n . Applying the second isomorphism 
theorem (Proposition 118.11) gives 

[An : Z A 'Xo)\ = [An : A n n Z s »] = [A n Z s Ja) : Z Sa (a) ] = [S n : Z Sn (cr)}, 

so the classes have the same size. Since [a)A n C [<r]s„ in any case, it follows that 
[o] 4 . = completing the proof. □ 

Therefore, coujugacy classes of even permutations either are preserved from S n 
to An or they split into two distinct, equal-sized classes. We are now in a position 
to give precise conditions determining which happens when. 

Proposition 4.15. Let a € A n , n>2. Then the conjugacy class of a in S n splits 
into two conjugacy classes in A n precisely if the type of a consists of distinct odd 
numbers. 


Proof. By Lemma 4.14, we have to verify that Zg n (<r) is contained in A n precisely 
when the stated condition is satisfied; that is, we have to show that 

tr ~ tot -1 =» r is even 


precisely when the type of o consists of distinct odd numbers. 

Write (T in cycle notation (including cycles of length 1): 

a = (m . . . a*)(6i . . . fy) ■ ■ • (ci . . . c„), 
and recall (Lemma 4.5) that 

rar~ l = («ir -1 . . .«z*r -1 ){fcir -1 . . . iyr -1 ) ■ ■ ■ (cir -1 . . . c„r _1 ). 


Assume that X,p. . . ,v are odd and distinct. If tot 1 = <r, then conjugation 
by r must preserve each cycle in a, as all cycle lengths are distinct: 

r(«i . . . aA)r -1 = (<*i . . . oa), etc., 


that is, 

(air" 1 . . . «at _ 1 ) = («i • . ■ a\), etc. 

This means that r acts as a cyclic permutation on (e.g.) a,,. ax and therefore 
in the same way as a power of (aj . . . ax). It follows that 

r = (ai ... aA) r (f>i • (ci ... c„) f 
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for suitable r,s,...,t. Since all cycles have odd lengths, each cycle is an even 
permutation; and t must then be even as it is a product of even permutations. 
This proves that Zs„(<?) C A„ if the stated condition holds. 

Conversely, assume that the stated condition does not hold: that is, either some 
of the cycles in the cycle decomposition have even length or all have odd length but 
two of the cycles have the same length. 

In the first case, let r be an even-length cycle in the cycle decomposition of o. 
Note that tot -1 = a: indeed, r commutes with itself and with all cycles in a other 
than r. Since r has even length, then it is odd as a permutation: this shows that 
Zs„ (c) g A n , as needed. 

In the second case, without loss of generality assume A = p, and consider the 
odd permutation 

r = (aibi)(a 2 b2) ■ (axb\) : 

conjugating by r simply interchanges the first two cycles in <r ; hence tot -1 = <r. 
As r is odd, this again shows that Zs n (o) g An, and we are done. □ 

Example 4.16. Looking again at Ar„ we have noted in §4.3 that the types of 
the even permutations in Ss are [1, 1, 1,1,1], [2, 2, 1], [3, 1, 1], and [5]. By Proposi- 
tion 4.15 the conjugacy classes corresponding to the first three types are preserved 
in M, while the last one splits. 

Therefore there are exactly 5 conjugacy classes in As, and the class formula for 
A 5 is 

60 = 1 + 15 + 20 + 12 + 12. j 

Finally! We can now complete a circle of thought begun in the first section of 
this chapter. Along the way, the reader has hopefully checked that every simple 
group of order < 60 is commutative (Exercise 2.24); and we now see why 60 is 
special: 

Corollary 4.17. The alternating group A$ is a simple noncommutative group of 
order 60. 

Proof. A normal subgroup of Ag is necessarily the union of conjugacy classes, 
contains the identity, and has order equal to a divisor of 60 (by Lagrange’s theorem). 
The divisors of 60 other than 1 and 60 are 

2, 3,4,5, 6 , 10, 12, 15,20,30; 
counting the elements other than the identity would give one of 
1,2,3,4,5,9,11,14,19,29 

as a sum of numbers ^ 1 from the class formula for A 5 . But this simply does not 
happen. □ 

The reader will check that Ag is simple, by the same method (Exercise 4.21). 
It is in fact the case that all groups A n , n > 5, are simple (and noncommutative), 
implying that S n is not solvable for n > 5, and these facts are rather important in 
view of applications to Galois theory (cf., e.g., Corollary VII.7.16). Note that A 2 
is trivial, A 3 = Z/3Z is simple and abelian, and A 4 is not simple (Exercise 2.24). 
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The altercating group A$ is also called the icosahedml (rotation) group: indeed, 
it is the group of symmetries of an icosahedron obtained through rigid motions. 
(Can the reader verify 20 this fact?) 

The simplicity of A„ for n > 5 may be established by studying 3 -cycles. First 
of all, it is natural to wonder whether every even permutation may be written as a 
product of 3-cycles, and this is indeed so: 

Lemma 4.18. The alternating group A n is generated by 3-cycles. 

Proof. Since every even permutation is a product of an even number of 2-cycles, 
it suffices to show that every product of two 2-cycles may be written as product of 
3-cycles. Therefore, consider a product 

( ab)(cd ) 

with a b, c ^ d. If (<i&) = (cd), then this product is the identity, and there is 
nothing to prove. If {a, b}, {c, d} have exactly one element in common, then we 
may assume c= a and observe 

( ab)(ad } = (abd). 

If {a, 6}, {c,d} are disjoint, then 

(ab)(cd) = (abc)(adc), 

and we are done. □ 

Now we can capitalize on our study of conjugacy in A„: 

Claim 4.19. Let n > 5. If a normal subgroup of An contains a 3 -cycle, then it 
contains all 3 -cycles. 

Proof. Normal subgroups are unions of conjugacy classes, so we just need to verify 
that 3-cycles form a conjugacy class in A n , for n > 5. But they do in and the 
type of a 3-cycle is [3, 1, 1, . . . ) for n > 5; hence the conjugacy class does not split 
in An, by Proposition 4.15. □ 

The general statement now follows easily by tying up loose ends: 

Theorem 4.20. The alternating group An is simple for n > 5. 

Proof. We have already checked this for n = 5, and the reader has checked it for 
n = 6. For n > 6, let N be a nontrivial normal subgroup of A,; we will show that 
necessarily N = A n> hy proving that N contains 3-cycles. 

Let r G N, t (1), and let c £ A n be a 3-cyde. Since the center of A n is 
trivial (Exercise 4.14) and 3-cycles generate A n> we may assume that r and <r do 
not commute, that is, the commutator 

[r,<r] = r(<rr“ 1 CT“ 1 ) = (r<rr _1 )<r _1 


2tl It is good practice to start with smaller examples: for instance, the tetrahedral rotation 
group is isomorphic to Aj. 
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is not the identity. This element is in N (as is evident from the first expression, 
as JV is normal) and is a product of two 3-cycles (as is evident from the second 
expression, since the conjugate of a 3-cycle is a 3-cycle). 

Therefore, replacing t by [r, cr] if necessary', we may assume that r € AMs 
a nonidentity permutation acting on < 6 elements: that is, on a subset of a set 
T C {1, . . . ,n} with |T| = 6. Now we may view As as a subgroup of A n , by letting 
it act on T. The subgroup N fl Ae of As is then normal (because N is normal) and 
nontrivial (because r € NO Ag and r ^ (1)). Since As is simple (Exercise 4.21), 
this implies N fl As = M- hi particular, N contains 3-cycles. 

By Claim 4.19, this implies that N contains all 3-cycles. By Lemma 4.18, it 
follows that N = A n , as needed. □ 

Corollary 4.21. For n > 5, the group S n is not solvable. 

Proof. Since A , , is simple, the sequence 

Sn2A n 2 {(1)} 

is a composition series for S n . It follows that the composition factors of S n are 
Z/2Z and A n . By Proposition 3.11, 5„ is not solvable. □ 

In particular, S$ is a uousolvable group of order 120. This is in fact the smallest 
order of a nonsolvable group; cf. Exercise 3.16. 


Exercises 


4.1. > Compute the number of elements in the conjugacy class of 

A234567 8\ 

\8 1275346/ 

in S 8 . [§4.1] 

4.2. > Suppose 

(«i • • • «r)(6i . . . 6a) • • • (ci . . . ct) = (di . . . rf tt )(ei ... e„) •■•(/»••• /«>) 
are two products of disjoint cycles. Prove that the factors agree up to order. 

(Hint: The two corresponding partitions of {1, . . . ,n) must agree.) [§4.1) 

4.3. Assume cr has type [Aj, . . . , A r ) and that the Aj’s are pairwise relatively prime. 
What is |cr|7 What can you say about |oj, without the additional hypothesis on 
the numbers A,? 


4.4. Make sense of the ‘Taylor series’ of the infinite product 

1 1 1 1 1 

(1 — x) (1 — a 2 ) (1 - * 3 ) (1 - # 4 ) (1 — a: 5 ) 

Prove that the coefficient of x n in this series is the number of partitions of n. 

4.6. Find the class formula for S n , n < 6. 
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4.6. Let N be a normal subgroup of S 4 . Prove that |IV| = 1, 4, 12, or 24. 

4.7. > Prove that S n is generated by (12) and (12 . . .n). 

(Hint: It is enough to get all transpositions. What is the conjugate of (12) by 
(12... n)?) [4.9, §VII.7.5] 

4.8. -< For n > 1, prove that the subgroup H of S n consisting of permutations 
fixing 1 is isomorphic to S„_i. Prove that there are no proper subgroups of S n 
properly containing H. [VTI.7.17] 

4.9. By Exercise 4.7, £4 is generated by (12) and (1234). Prove that (13) and 
(1234) generate a copy of Dg in S 4 . Prove that every subgroup of S4 of order 8 is 
conjugate to ((13), (1234)). Prove there are exactly 3 such subgroups. For all n > 3 
prove that S n contains a copy of the dihedral group Z) 2n , and find generators for it. 

4.10. -> • Prove that there are exactly (n — 1)! ra- cycles in S n . 

• More generally, find a formula for the size of the conjugacy class of a permu- 
tation of given type in S n . [4.11] 

4.11. Let p be a prime integer. Compute the number of p-Sylow subgroups of S p . 
(Use Exercise 4.10.) Use this result and Sylow’s third theorem to prove again the 
‘only if’ implication in Wilson’s theorem (cf. Exercise II.4.16.) 

4.12. ▻ A subgroup G of S n is transitive if the induced action of G on (1 n} 

is transitive. 

• Prove that if G C S n is transitive, then |G| is a multiple of n. 

• List the transitive subgroups of S3. 

• Prove that the following subgroups of £4 are all transitive: 

- ((1234)) S£ C 4 and its conjugates, 

- {(12)(34),(13)(24))3fC 2 xC 2 , 

- {(12)(34), (1234)) = jDg and its conjugates, 

- A 4 , and S 4 . 

With a bit of stamina, you can prow that these are the only transitive subgroups 
of S 4 - 

[§VII.7.5] 

4.13. (If you know about determinants.) Prove that the sign of a permutation <r, 
as defined in Definition 4.10, equals the determinant of the matrix M a defined in 
Exercise II.2.1. 

4.14. > Prove that the center of A,, is trivial for n > 4. |§4.4] 

4.15. Justify the ‘pictorial’ recipe given in §4.3 to decide whether a permutation 
is even. 

4.16. The number of conjugacy classes in A n , n > 2, is (allegedly) 

1,3,4,5,7,9,14,18,24,31,43 

Check the first several numbers in this list by finding the class formulas for the 
corresponding alternating groups. 
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4.17. t> • Find the class formula for A 4 . 

• Use it to prove that A 4 has no subgroup of order 6 . [§11.8.5] 

4.18. For n > 5, let H be a proper subgroup of A n . Prove that [A n : H] > n. 
Prove that A„ does have a subgroup of index n for all n > 3. 

4.19. Prove that there are no nontrivial actions of A n on any set S with |,S , | < n. 
Construct 21 a nontrivial action of A 4 on a set S, |S| = 3. Is there a nontrivial 
action of A 4 on a set S with |S| = 2 V 

4.20. -> Find all fifteen elements of order 2 in A$, and prove that A$ has exactly 
five 2-Sylow subgroups. [4.22] 

4.21. > Prove that A$ is simple, by using its class formula (as is done for A$ in the 
proof of Corollary 4.17). [§4.4] 

4.22. -1 Verify that A$ is the only simple group of order 60, up to isomorphism. 
(Hint: By Exercise 2.25, a simple group G of order 60 contains a subgroup of index 5. 
Use this fact to construct a homomorphism G — » S 5 , and prove that the image of 
this homomorphism must be ^ 5 .) Note that A$ has exactly five 2-Sylow subgroups; 
cf. Exercise 4.20. Thus, the other possibility contemplated in Exercise 2.25 does 
not occur. [2.25] 


5. Products of groups 

We already know that products exist in Grp (see §11.3.4); here we analyze this notion 
further and explore variations on the same theme, with an eye towards the qnestion 
of determining the information needed to reconstruct a group from its composition 
factors. 

5.1. The direct product. Recall from §11.3.4 that the (direct) product of two 
groups H, K is the group supported on the set H x K, with operation defined com- 
ponentwise. We have checked (Proposition II.3.4) that the direct product satisfies 
the universal property defining products in the category Grp. 

There are situations in which the direct product of two subgroups N, H of a 
group G may be realized as a subgroup of G. Recall (Proposition II.8.11) that if 
one of the subgroups is normal, then the subset NH of G is in fact a subgroup of 
G. The relation between NH and N x H depends on how N and H intersect in 
G, so we take a look at this intersection. 

The ‘commutator’ [A, B] of two subsets A, B of G (see §3.3) is the subgroup 
generated by all commutators [«, 5] with o e A, b e B. 

Lemma 5.1. Let N, H be normal subgroups of a group G. Then 

[N,H]CNr\H. 


21 You can think algebraically if you want; if you prefer geometry, visualize pairs of opposite 
sides on a tetrahedron. 
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Proof. It suffices to verify this on generators; that is, it suffices to check that 

[n, A] = n(fcra _1 A _1 ) = (ra/m -1 )/ 1 _1 6 N D H 

for all n € N, h € H. But the first expression and the normality of N show that 
[n, A] € N; the second expression and the normality of H show that [n, H] 6 H. □ 

Corollary 5.2. Let N, H be normal subgroups of a group G. Assume NOH = {e}. 
Then N, H commute with each other: 

(Vn € N) (VA e H) nh = hn. 

Proof. By Lemma 5.1, (IV, H\ = {e} if JVnif = {e}; the result follows immediately. 

□ 


In feet, under the same hypothesis more is true: 

Proposition 5.3. Let N, H be normal subgroups of a group G, such that Nr\H — 
{e}. Then NH = N x H. 

Proof. Consider the function 

<p:NxH~*NH 

defined by <p(n, h) = nh. Under the stated hypothesis, ip is a group homomorphism: 
indeed 


V((m,hi) • (n2,h 2 )) = ¥>((miW2, AiA 2 )) 
= niri2^if*2 
= nihiii2^2 


since N, H commute by Corollary 5.2 


The homomorphism <p is surjective by definition of NH. To verify it is injective, 
consider its kernel: 

ker (p = {(n, h) € JV x H | nh = e}. 

If nh = e, then n € IV and n = h~ l 6 H; thus n — e since NnH = {e}. Using the 
same token for h, we conclude h = e; hence (n, h) — the identity in N x H, proving 
ip is injective. 

Thus <p is an isomorphism, as needed. □ 

Remark 5.4. This result gives an alternative argument for the proof of Claim 2.16: 
if |G| = pq, with p < q prime integers, and G contains normal subgroups H, K of 
order p, q, respectively (as is the case if q ^ 1 mod p, by Sylow), then H D I< - {e} 
necessarily, and then Proposition 5.3 shows HK = H x K. As \HK\ = |G| = pq, 
this proves G = H x K = Z/pZ x ZjqZ. Finally, (1, 1) has order pq in this group, 
so G is cyclic, with the same conclusion we obtained in Claim 2.16. j 
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5.2. Exact sequences of groups; extension problem. Of course, the hypoth- 
esis that both subgroups N, H are normal is necessary for the result of Proposi- 
tion 5.3: for example, the permutations (123) and (12) generate subgroups N, H of 
S 3 meeting only at {e}, and N is normal in S 3 , but S 3 = NH is not isomorphic to 
the direct product of N and H. It is natural to examine this more general situation. 

Let N, H be subgroups of a group G, with N normal (but with no a priori 
assumptions on H) and such that N O H = {e}; assume G = NH. We are after a 
description of the structure of G in terms of the structure of N and H. 

It is notationally convenient to use the language of exact sequences, introduced 
for modules in §111.7.1. A (short) exact sequence of groups is a sequence of groups 
and group homomorphisms 

1 >N-?->G — P -^H > 1 

where ip is surjective and <p identifies N with ker ip. In other words (by the first 
isomorphism theorem), use <p to identify N with a subgroup of G\ then the sequence 
is exact if N is normal in G and ip induces an isomorphism G/N — ► H. 

The reader should pause a moment and check that if G, N, H are abelian, then 
this notion matches precisely the notion of short exact sequence of abelian groups 
(i.e., Z-modules) from §111.7.1; a notational difference is that here the trivial group 
is denoted 22 T’ rather than ‘O’. 

Of course there always is an exact sequence 

1 >N >N x H >H >1 : 

map n e N to (n, en ) and (n, A) e N x H to h. However, keep in mind that this 
is a very special case: to reiterate the example mentioned above, there also is an 
exact sequence 

1 >■ C 3 > S 3 > C *2 > 1 , 

yet S 3 C 3 x C 3 . 

Definition 5.5. Let N, H be groups. A group G is an extension of H by N if 
there is an exact sequence of groups 

1 >N >G >H + 1 . j 

The extension problem aims to describe all extensions of two given groups, 
up to isomorphism. For example, there are two extensions of Cj by C 3 : namely 
C§ = C 3 x C 3 and S 3 ; we will soon be able to verify that, up to isomorphism, there 
are no other extensions. 

The extension problem is the ‘second half’ of the classification problem: the 
first half consists of determining all simple groups, and the second half consists of 
figuring out how these can be put together to construct any group 23 . For example, 

22 Tbis is not unreasonable, since groups are more often written ‘multiplicatively’ rather than 
additively, so the identity element is more likely to be denoted 1 rather than 0. 

23 As mentioned earlier, the first half has been settled, although the complexity of the work 
leading to its solution justifies some skepticism concerning the absolute correctness of the proof. 
The status of the second half is, as far as we know, (even) murkier. 
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if 

G = Go 2 Gi 2 c 2 2 G s 2 C 4 = {e} 

is a composition series, with (simple) quotients Hi = then G is an ex- 

tension of Ho by an extension of H\ by an extension of Hi by H$: knowing the 
composition factors of G and the extension process, it should in principle be possible 
to reconstruct G. 

We are going to ‘solve’ the extension problem in the particular case in which 
H is also a subgroup ofG, intersecting N at {e}. 

Definition 5.6. An exact sequence of groups 

1 >N >G >H — n 

(or the corresponding extension) is said to split if H may be identified with a 
subgroup of G , so that N n H - {e}. 

We encountered this terminology in §111.7.2 for modules, thus for abelian groups. 
Note that the notion examined there appears to be more restrictive than Defini- 
tion 5.6, since it requires G to be isomorphic to a direct product N x H. This 
apparent mismatch evaporates because of Proposition 5.3: in the abelian case, 
every split extension (according to Definition 5.6) is in feet a direct product. 

Of course split extensions are anyway very special, since quotients of a group G 
are usually not isomorphic to subgroups of G, even in the abelian case (cf. Exer- 
cise 5.4). 

Lemma. 5.7. Let N be a normal subgroup of a group G, and let H be a subgroup 
ofG such that G = NH and N O H = {e}. Then G is a split extension of H by N . 

Proof. We have to construct an exact sequence 

1 >N > G >H >1 ; 

we let A r -+ G be the inclusion map, and we prove that G/N = H. For this, 
consider the composition 

a : H •-* G -» G/N. 

Then a is surjective: indeed, since G = NH, Vg e G we have g = nh for some 
n g N and h € H, and then 

gN = nhN = h(h~ l nh)N = hN = <x(h). 

Further, kera = {h 6 H | hN = N} = N n H = {e}; therefore a is also injective, 
as needed. □ 

To recap, if in the situation of Lemma 5.7 we also require that H be normal in 
G, then G is necessarily isomorphic to the ‘trivial’ extension N x H: this is what 
we have proved in Proposition 5.3. We are seeking to describe the extension ‘even 
if’ H is not normal in G. 
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5.3. Internal/semidirect products. The attentive reader should have noticed 
that the key to Proposition 5.3 is really Corollary 5.2: if both N and H are normal 
and NnH = {e}, then N and H commute with each other. This is what ultimately 
causes the extension NH to be trivial. Now, recall that as soon as N is normal, 
then every subgroup H of G acts on N by conjugation: in fact (cf. Exercise 1.21) 
conjugation determines a homomorphism 

7 : H — Aut Glp (N), h <-> 7h. 

(Explicitly, for h e H the automorphism 7* : N — » N acts by 7 h(n) := ftn/t -1 .) 
The subgroups H and N commute precisely when 7 is trivial. Corollary 5.2 shows 
that if N and H are both normal and NnH = {e}, then 7 is indeed trivial. 

This is the crucial remark. The next several considerations may be summarized 
as follows: if N is normal in G, H is a subgroup of G, NnH — {e} and G = NH, 
then the extension G of H by N may be reconstructed from the conjugation action 
7 : H — ♦ Ant Grp (jV). The reader is advised to stare at the following triviality, which 
is' the motivating observation for the general discussion: 

(*) (Vwi,7»2 € N), (Vhi,li2 € H) nihiTiihi - (ni(hin2/i^ 1 ))(hih 2 ). 

This says that if we know the conjugation action of H on N, then we can recover 
the operation in G from this information and from the operations in N and H. 

Here is the general discussion. It is natural to abstract the situation and begin 
with any two groups N, H and an arbitrary homomorphism 24 

6: H — » AutQp(lV), h *-> 0/,. 

Define an operation •$ on the set N x H as follows: for ni, n% € N and hi , € H, 

let 

(m,hi) •« («2,h2) := («i0k,(rt2), hihi). 

This will look more reasonable once it is compared with (*)! 

Lemma 5.8. The resulting structure {N x H, •$) is a group, with identify element 
(ew.eff)- 

Proof. The reader should carefully verify this. For example, inverses exist because 

(ni,fci)**(0j,-i(«r 1 )» A r 1 ) = (»i«h,(\->("T 1 )).Ai*r 1 ) = ( n i ra r : > e «) = ( e N,en) 

and similarly in the reverse order. □ 

Definition 5.9. The group (IV x H, •«) is a semidirect product of N and H and is 
denoted by N xgH. j 

For example, the ordinary direct product is a semidirect product and corre- 
sponds to 0 = the trivial map. If the reader feels a little uneasy about giving 
one name ( semidirect product) for a whole host of different groups supported on 
the Cartesian product, welcome to the club. In fact, it gets worse still: it is not 

24 The reason why we are not denoting the image of h by 9 as 9(h) is that this is an auto- 
morphism of N and we dislike the notation 9(h) (n) for the image of re € N obtained by applying 
the automorphism corresponding to h. The alternative 0^ (n) looks a little easier to parse. 
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uncommon to omit ‘0’ from the notation and simply write TV * H for a semidirect 
product 26 . 

In any case, the notation *e is too heavy to carry around, so we generally revert 
back to the usual simple juxtaposition of elements in order to denote multiplication 
in N H. The following proposition checks that semidirect products are split 
extensions: 

Proposition 5.10. Let TV, H be groups, and let 9 : H -* Autc rp (f\f) 6e a homo- 
morphism; let G = TV H be the corresponding semidirect product Then 

• G contains isomorphic copies of TV and H; 

• the natural projection G — » H is a surjective homomorphism, with kernel N; 
thus N is normal in G, and the sequence 

1 ► TV > N »$H *H *1 

is (split) exact; 

• NnH={e a }; 

• G = NH; 

• the homomorphism 9 is realized by conjugation in G: that is, for h 6 H and 
n € N we have 

9h(n) — hnh _I 

in G. 

Proof. The functions N —> G, H —> G defined for n e TV, h € H by 
n >-» (n, ejjr), h i-* (e#, h) 

are manifestly injective homomorphisms, allowing us to identify TV, H with the 
corresponding subgroups of G. It is clear that TV ft H = {(en,en)} — and 

(re, e H ) •$ (ejy, h) = (n, h) 

shows that G = NH. 

The projection G — t H defined by 

(n, h ) •-» h 

is a surjective homomorphism, with kernel TV; therefore TV is normal in G. Finally, 
(e N ,h) •« ( n , e H ) •$ (e/v, h) -1 = ( 9 h (n ), h ) m a (e N ,h~ l ) = (6 h (n), e H ), 
as claimed in the last point. □ 

Our original goal of ‘reconstructing* a given split extension of a group if by a 
group TV is a sort of converse to this proposition. More precisely, 


25 This is actually OK, if N and H are given as subgroups of a common group G, in which 
case the implicit action is just conjugation. 
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Proposition 5.11. Let N, H be subgroups of a group G, with N normal in G. 
Assume that N fl IT = {e}, and G = NH . Let 7 : H — > AutG<p(JV) be defined by 
conjugation: for he H,ne N, 

7 h(n) = hnh~ l . 

Then G 9* N H. 

Proof. Define a function 

<p : N Xy H -* G 

by <p(n, h) = nk; this is clearly a bijection. We need to verify that <p is a homomor- 
phism, and indeed (Vn € JV), (VJi € H) : 

y>((ni,fei) » 7 (n 2 M)) = p((rM 7^(712), hi /12)) 

= <fi((n}(hin 2 hy 1 ),h l h^)) 

= 1 hi)li 2 = (nihi)(n 2 h 2 ) 

= >fi((nuh 1 ))(p((n 2l h 2 )) 

as needed. □ 

When realized ‘within’ a group, as in the previous proposition, the semidirect 
product is sometime called an internal product. 

Remark 5.12. If N and H commute, then the conjugation action of H on N is 
trivial; therefore 7 is the trivial map, and the semidirect product JV x 7 H is the 
direct product NxH. Thus, Proposition 5.11 recovers the result of Proposition 5.3 
in this case. _i 

Example 5.13. The automorphism group of C 3 is isomorphic to the cyclic group 
C 2 : if C 3 = {e, y, y 2 }, then the two automorphisms of C 3 are 

{ e ■-» e, f e i-» e, 

SM-* V> <r:| Jfi-tj/ 2 , 

V 2 >->y 2 , l if 2 1 -+ y. 

Therefore, there are two homomorphisms C 2 — » Aut G , p (C 3 ): the trivial map, and 
the isomorphism sending the identity to id and the nonidentity element to a. The 
semidirect product corresponding to the trivial map is the direct product C 3 x C 2 = 
C$; the other semidirect product C 3 x C 2 is isomorphic to S 3 . This can of course 
be checked by hand (and you should do it, for fun); but it also follows immediately 
from Proposition 5.11, since N = {(123)), H = {(12)) C S 3 satisfy the hypotheses 
of this result. j 

The reader should contemplate carefully the slightly more general case of dihe- 
dral groups (Exercise 5.11); this enriches (and in a sense explains) the discussion 
presented in Claim 2.17. 

In fact, semidirect products shed light on all groups of order pq, for p < q 
primes; the reader should be able to complete the classification of these groups 
begun in §2.5.2 and show that if q = 1 modp, then there is exactly one such non- 
commutative group up to isomorphism (Exercise 5.12). 
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The reader would in fact be well-advised to try to use semidirect products to 
classify groups of small order: if a nontrivial normal subgroup N is found (typically 
by applying Sylow’s theorems), with some luck the classification is reduced to the 
study of possible homomorphisms from known groups to Auto p (A r ) and can be 
carried out. See Exercise 5.15 for a further illustration of this technique. 


Exei’cises 


5.1. > Let G be a finite group, and let Pi , . . . , P r be its nontrivial Sylow subgroups. 
Assume all Pi are normal in G. 

• Prove that G = Pi x ■ ■ • x P T . (Induction on r; use Proposition 5.3.) 

• Prove that G is nilpoteut. (Hint: Mod out by the center, and work by induction 
on |G|. What is the center of a direct product of groups?) 

Together with Exercise 3.10, this shows that a finite group is nilpotent if and only 
if each of its Sylow subgroups is normal. (3.12, §6.1] 

5.2. Let G be an extension of H by N. Prove that the composition factors of G 
are the collection of the composition factors of H and those of N. 

5-3. Let 

G = Go 2 Gi 3 • • ■ 2 G r = {«} 

be a normal series. Show how to ‘connect’ {e} to G by means of r exact sequences 
of groups, involving only {e}, G, and the quotients Hi = G;/Gi +1 . 

5.4. o Prove that the sequence 

0 >z— J-Z/2Z >0 

is exact but does not split. [§5.2] 

5.5. In Proposition III. 7.5 we have seen that if an exact sequence 

0 >N/(<p(M)) s-0 

of abelian groups splits, then ip has a left-inverse. Is this necessarily the case for 
split sequences of groups? 

5.6. Prove Lemma 5.8. 

5.7. Let N be a group, and let a : N -* N be an automorphism of N. Prove 
that a may be realized as conjugation, in the sense that there exists a group G 
containing AT as a normal subgroup and such that a(n) = gng -1 for some g 6 G. 

5.8. Prove that any semidirect product of two solvable groups is solvable. Show 
that semidirect products of nilpotent groups need not be nilpoteut. 

5.9. > Prove that if G = N x H is commutative, then G= N x H. (§6.1] 
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5.10. Let N be a normal subgroup of a finite group G, and assume that |iV| 
and \G/N\ are relatively prime. Assume there is a subgroup H in G such that 
| if | = |G/AT|. Prove that G is a semidirect product of N and H. 

5.11. t> For all n > 0 express Din as a semidirect product C n »s C 2 , finding 6 
explicitly. [§5.3] 

5.12. > Classify groups G of order pq, with p < q prime: show that if |G| = pq, 
then either G is cyclic or q = 1 modp and there is exactly one isomorphism class 
of noncommutative groups of order pq in this case. (You will likely have to use the 
fact that Autc,p(G 9 ) =f C q _i if q is prime; cf. Exercise II.4.15.) [§2.5, §5.3] 

5.13. -1 Let G = N Mg H be a semidirect product, and let K be the subgroup of G 
corresponding to ker 0 C H. Prove that K is the kernel of the action of G on the 
set G/H of leffc-cosets of H. [5.14] 

5.14. Recall that S 3 = AutGtp(C 2 x C 2 ) (Exercise II.4.13). Let 1 be this isomor- 
phism. Prove that (C 2 x C 2 ) x t 53 = S 4 . (Hint: Exercise 5.13.) 

5.15. > Let G be a group of order 28. 

• Prove that G contains a normal subgroup N of order 7. 

• Recall (or prove again) that, up to isomorphism, the only groups of order 4 are 
C 4 and Ci x C 2 . Prove that there are two homomorphisms C 4 — » AutG,p(JV) 
and two homomorphisms C 2 x C 2 — ♦ Aut<;,p(N) up to the choice of generators 
for the sources. 

• Conclude that there are four groups of order 28 up to isomorphism: the two 
direct products C 4 x C 7 , C 2 x C 2 x C 7 , and two noncommutative groups. 

• Prove that the two noncommutative groups are £>28 and C 2 X £> 14 . 

[§5-3] 

5.16. Prove that the quaternionic group Qg (cf. Exercise UI.1.12) cannot be written 
as a semidirect product of two nontrivial subgroups. 

5.17. Prove that the multiplicative group H* of nonzero quaternions (cf. Exer- 
cise II1.1.12) is isomorphic to a semidirect product SU 2 (C) x R + . (Hint: Exer- 
cise III.2.5.) Is this semidirect product in fact direct? 


6. Finite abelian groups 

We will end this chapter by treating in some detail the classification theorem for 
finite abelian groups mentioned in §11.6.3. 

6.1. Classification of finite abelian groups. Now that we have acquired more 
familiarity with products, we are in a position to classify all finite abelian groups 26 . 

In due time (Proposition VI.2.11, Exercise VT.2.19) we will in fact he able to 
classify all finitely generated abelian groups: as mentioned in Example II.6.3, all 

28 Of course fancier semidirect products will not be needed here; cf. Exercise 5.9. 
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such groups are products of cyclic groups 27 . In particular, this is the case for finite 
abelian groups: this is what we prove in this section. 

Since in this section we exclusively deal with abelian groups, we revert to the 
abelian style of notations: thus the operation will be denoted +; the identity will 
be 0; direct products will be called direct sums (and denoted ffi); and so on. 

First of all, we will congeal into an explicit statement a simple observation that 
has been with us in one form or another since at least as far back as 28 Exercise II.4.9. 

Lemma 6.1. Let G be an abelian group , and letH,K be subgroups stick that |H|, 
\K\ are relatively prime. Then H + K = H © K. 

Proof. By Lagrange’s theorem (Corollary II.8.14), H n K = {0}. Since subgroups 
of abelian groups are automatically normal, the statement follows from Proposi- 
tion 5.3. □ 

Now let G be a finite abelian group. For each prime p, the p-Sylow subgroup 
of G is unique, since it is automatically normal in G. Since the distinct nontrivial 
Sylow subgroups of G axe p-groups for different primes p, Lemma 6.1 immediately 
implies the following result. 

Corollary 6.2. Every finite abelian group is the direct sum of its nontrivial Sylow 
subgroups. 

(The diligent reader knew already that this had to be the case, since abelian groups 
are nilpotent; cf. Exercise 5.1.) Thus, we already know that every finite abelian 
group is a direct sum of p-groups, and our main task amounts to classifying abelian 
p-groups for a fixed prime p. This is somewhat technical; we will get there by a 
seemingly roundabout path. 

Lemma 6.3. Let G be an abelian p-group, and let g & G be an element of maximal 
order. Then the exact sequence 

0 *{g) M7 *G/(g) >0 

splits. 

Put otherwise, there is a subgroup L of G such that L maps isomorphically to 
G/{g) via the canonical projection, that is, such that (g) n L = {0} and {g)+L= G. 
Note that it will follow that G = (g) © L, by Proposition 5.3. 

The main technicality needed in order to prove this lemma is the following 
particular case: 

Lemma 6.4. Let p be a prime integer and r > 1. Let G be a noncyclic abelian 
group of order p r+1 , and let g 6 G be an element of order p T . Then there exists an 
element he G, h# (g), such that |/i| = p. 


27 The natural context to prove this more general result is that of modules over Euclidean 
rings or even principal ideal domains. 

28 In fact, this observation will really find its most appropriate resting place when we prove 
the Chinese Remainder Theorem, Theorem V.6,1. 
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Lemma 6.4 is a special case of Lemma 6.3 in the sense that, with notation as in 
the statement, necessarily { ft ) = G/(g), and in fact G = (g) © (ft) (and the reader 
is warmly encouraged to understand this before proceeding!). That is, we can split 
off the ‘large’ cyclic subgroup (g) as a direct summand of G, provided that G is 
not cyclic and not much larger than (g). Lemma 6.3 claims that this can be done 
whenever (g) is a maximal cyclic subgroup of G. We will be able to prove this more 
general statement easily once the particular case is settled. 

Proof of Lemma 6.4. Denote (g) by K, and let ft' be any element of G, h! f? K. 
The subgroup K is normal in G since G is abelian; the quotient group G/K has 
order p. Since hi fl K, the ooset hi + K has order p in G/K; that is, ph' 6 K. Let 
k = ph'. 

Note that |fc| divides p T ; hence it is a power of p. Also \k\ ^ p T , otherwise 
|ft'| = p r+1 and G would be cyclic, contrary to the hypothesis. 

Therefore |ft| = p 1 for some s < r; k generates a subgroup (k) of the cyclic 
group K, of order p s . By Proposition II.6.11, (k) = (p r ~’g). Since s < r, (k) C 
(pg); thus, k = mpg for some m € Z. 

Then let h—h' — mg: ft ^ 0 (since hi £ K ), and 

ph = ph! — p(mg) = k — k = 0, 

showing that |ft| = p, as stated. □ 

Proof of Lemma 6.3. Argue by induction on the order of G; the case |G| = p° = 
1 requires no proof. Thus we will assume that G is nontrivial and that the statement 
is true for every p-group smaller than G. 

Let g € G be an element of maximal order, say p r , and denote by K the 
subgroup (g) generated by g; this subgroup is normal, as G is abelian. If G = K, 
then the statement holds trivially. If not, G/K is a nontrivial p-group, and hence it 
contains an element of order p by Cauchy’s theorem (Theorem 2.1). This element 
generates a subgroup of order p in G/K, corresponding to a subgroup G' of G of 
order p r+1 , containing K. This subgroup is not cyclic (otherwise the order of g is 
not maximal). 

That is, we are in the situation of Lemma 6.4: heuce we can conclude that there 
is an element ft e <?' (and hence ft € G) with ft £ K and |ft| = p. Let H = (ft) CG 
be the subgroup generated by ft, and note that K n H = {0}, 

Now work modulo H. The quotient group G/H has smaller size than G, and 
g + H generates a cyclic subgroup K' = (K + H)/H = K/(KC\H) = K of maximal 
order in G/H. By the induction hypothesis, there is a subgroup U of G/H such 
that K' + L' = G/H and K' C\L' = {fta/H}- This subgroup V corresponds to a 
subgroup L of G containing H. 

Now we claim that (i) K + L = G and (ii) Kf\L= {0}. Indeed, we have the 
following: 

(i) For any a € G, there exist mg + H e K', t+HeL' such that a 4- H = 
mg + £ + H (since K' + L' — G/H). This implies a — mg € L, and hence 
a € K + L as needed. 
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(ii) If o € KOL, then a+H € K'C\L' = {Oq/#}, and hence a € H. In particular, 
a 6 K fl H = {0}, forcing a = 0, as needed. 

(i) and (ii) imply the lemma, as observed in the comments following the statement. 

□ 


Now we are ready to state the classification theorem; the proof is quite straight- 
forward after all this preparation work. We first give the statement in a somewhat 
coarse form, as a corollary of the previous considerations: 

Corollary 6.5. Let G be a finite abelian group . Then G is a direct sum of cyclic 
groups, which may be assumed to be cyclic p-groups. 

Proof. As noted in Corollary 6.2, G is a direct sum of p-groups (as a consequence 
of the Sylow theorems). We claim that every abelian p-group P is a direct sum of 
cyclic p-groups. 

To establish this, argue by induction on \P\. There is nothing to prove if P is 
trivial. If P is not trivial, let g be an element of P of maximal order. By Lemma 6.3 

P = {g)<BP' 

for some subgroup P of P; by the induction hypothesis P' is a direct sum of cyclic 
p-groups, concluding the proof. □ 


6.2. Invariant factors and elementary divisors. Here is a more precise version 
of the classification theorem. It is common to state the result in two equivalent 
forms. 


Theorem 6.6. Let G be a finite nontrivial abelian group. Then 


* there exist prime integers pi, . . . ,p r and positive integers n,j such that |<?| = 

n «** 

* there exist positive integers I < di \ • ■ • \ d s sxtch that |G| = d\ - • • d s and 


G = 


Z m m Z 
diZ®'“® dX 


Further, these decompositions are uniquely determined by G. 


The first form is nothing but a more explicit version of the statement of Corol- 
lary 6.5, so it has already been proven. We will explain how to obtain the second 
form from the first. The uniqueness statement 29 is left to the reader (Exercise 6.1). 

The prime powers appearing in the first form of Theorem 6.6 are called the 
elementary divisors of G\ the integers d- t appearing in the second form are called 
invariant factors. To go from elementary divisors to invariant factors, collect the 

29 Of course the ‘uniqueness’ statement only holds up to trivial manipulation such as a per- 
mutation ot the {actors. The claim is that the factors themselves are determined by G, in the sense 
that two direct sums of either form given in the statement are isomorphic only if their factors 
match. 
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elementary divisors in a table, listing (for example) prime powers according to in- 
creasing primes in the horizontal direction and decreasing exponents in the vertical 
direction; then the invariant factors are obtained as products of the factors in each 
row: 


d r = 

Pi n 

Tloi 

Pi 

vT 


dr-l = 


Vi 

vT 2 


1 

w 

ii 

p? ia 

Vi 

pT 

... 







Conversely, given the invariant factors di, obtain the rows of this table by 
factoring di into prime powers: the condition d\ \ • ■ • \ dr guarantees that these will 
be decreasing. 

Repeated applications of Lemma 6.1 show that if d = p" 1 ■ ■ -p” r for distinct 
primes pi and positive n, (as is the case in each row of the table), then 

z z z 

dZ = pJ'Z®'”® p?"Z’ 

proving that the two decompositions given in Theorem 6.6 are indeed equivalent. 
This will likely be much clearer once the reader works through a few examples. 


Example 6.7. Here are the two decompositions for a (random) group of order 
29160 = 2 s • 3 6 • 5: 


/ z ^ z _ z 
(,22 ® 22 ® 22 


32 ® 32 




2 2 2 \ 
62 ® 182 ® 902 ) 


and here is the corresponding table of invariant factors/elementary divisors: 


90 = 

2 

3 2 

5 

18 = 

2 

l 2 " 


6 = 

2 

3 


3 = 


3 



Example 6.8. There are exactly 6 isomorphism classes of abelian groups of or- 
der 360. Indeed, 360 = 2 s ■ 3 2 • 5; the six possible tables of elementary divisors 
are shown below. In terms of invariant factors, the six distinct abelian groups of 
order 360 (up to isomorphism, by the uniqueness part of Theorem 6.6) are therefore 

z z z z z z 
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90= 

IB 


B 

□ 

IB 

■ 


E3 

IB 

■ 

■ 


180= 


□ 

B 

■S 

|Q 

■ 

B 


■ 

■ 

B 


360= 


Q 

B 


■ 


B 


■ 

■ 

■ 


30= 

IB 

B 

B 

m 

B 

B 


2= 

IB 

1 

■ 



a 

B 

B 


■ 

B 

■ 


■ 

■ 



60= 

El 

B 

B 

6= 



B 

■ 1 

■ 

■ 



6.3. Application: Finite subgroups of multiplicative groups of fields. Any 
classification theorem is useful in that it potentially reduces the proof of general 
facts to explicit verifications. Here is one example illustrating this strategy: 

Lemma 6.9. Let G be a finite abelian group, and assume that for every integer n 
the number of elements g € G such that ng = 0 is at most n. Then G is cyclic. 

The reader should try to prove this ‘by hand’, to appreciate the fact that 
it is not entirely trivial. It does become essentially immediate once we take the 
classification of finite abelian groups into account. Indeed, by Theorem 6.6 



for some positive integers 1 < di | • • ■ | d t . But if s > 1, then |G| > d s and d a g = 0 
for all g € G (so that the order of g divides d s ), contradicting the hypothesis. 
Therefore a = 1; that is, G is cyclic. 

Lemma 6.9 is the key to a particularly nioe proof of the following important 
fact, a weak form of which 30 we ran across back in Example II.4.6. Recall that the 
set F* of nonzero elements of a field F is a commutative group under multiplication. 
Also recall (Example III.4.7) that a polynomial f(x) € F\x J is divisible by ( x — o) 
if and only if /(«) = 0; since a nonzero polynomial of degree n over a field can 
have at most n linear factors, this shows 31 that if f(x) 6 F[z] has degree n, then 
/(a) = 0 for at most n distinct elements a 6 F. 

Theorem 6.10. Let F be a field, and let G be a finite subgroup of the multiplicative 
group ( F , ■). Then G is cyclic. 

Proof. By the considerations preceding the statement, for every n there are at 
most n elements a e F such that a" - 1 = 0, that is, at most n elements a € G 
such that a n = 1. Lemma 6.9 implies then that G is cyclic. □ 

3 ®The diligent reader hae proved that particular case in Exercise 11.4.11. The proof hinted 
at in that exerciBe upgrades easily to the general case presented here. My point is not that the 
classification theorem is necessary in order to prove statements such as Theorem 6.10; our point 
is that it makes such statements nearly evident. 

31 Unique factorization in F[x\ is secretly needed here. We will deal with this issue more 
formally later; cf. Lemma V.5.1. 
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As a (very) particular case, the multiplicative group ((Z/pZ)*, •) is cyclic: this 
is the fact pointed to in Example 11.4.6. 

Preview of coming attractions: Finitely generated (as opposed to just Unite) 
abelian groups are also direct sums of cyclic groups. The only difference between 
the classification of finitely generated abelian groups and the classification of finite 
abelian groups explored here is the possible presence of a ‘free’ factor Z® r in the 
decomposition. The reader will first prove this fact in Exercise VI.2.19, as a conse- 
quence of ‘Gaussian elimination over integral domains’, and then recover it again 
as a particular case of the classification theorem for finitely generated modules 
over PIDs, Theorem VI.5.6. Neither Gaussian elimination nor the very general 
Theorem VI.5.6 are any harder to prove than the particular case of finite abelian 
groups laboriously worked out by hand in this section— a common benetit of finding 
the right general point of view is that, as a rule, proofs simplify. Technical work 
such as that performed in order to prove Lemma 6.3 is absorbed into the work 
necessary to build up the more general apparatus; the need for such technicalities 
evaporates in the process. 


Exercises 


6.1. > Prove that the decomposition of a finite abelian group G as a direct sum 
of cyclic p- groups is unique. (Hint: The prime factorization of |G| determines the 
primes, so it suffices to show that if 

z ^ ^ z 

— — © • ■ • © — - ^ — — © • • • © — — , 
p Tl Z p r mZ P S1 Z p 8n Z 

with ri > • • • > r m and si >••• > s„, then m = n and r* = for all i- Do this by 
induction, by considering the group pG obtained as the image of the homomorphism 
G -*G defined by g>~* pg.) [§6.2, §VI.5.3, VI.5.12] 

6.2. Complete the classification of groups of order 8 (cf. Exercise 2.16). 

6.3. Let G be a noncommutative group of order p 3 , where p is a prime integer. 
Prove that Z{G) = Z/pZ and G/Z(G) = Z/pZ x Z/pZ. 

6.4. Classify abelian groups of order 400. 

6.5. Let p be a prime integer. Prove that the number of distinct isomorphism classes 
of abelian groups of order p T equals the number of partitions of the integer r. 

6.6. t> How many abelian groups of order 1024 are there, up to isomorphism? 
(§11.6.3] 

6.7. -> Let p > 0 be a prime integer, G a finite abelian group, and denote by 
p : G — * G the homomorphism defined by p(g) = pg. 

• Let A be a finite abelian group such that pA = 0. Prove that A = Z/pZ © ■ • ■ © 
Z/pZ. 

• Prove that pkerp and p(cokerp) are both 0. 
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• Prove that ker p = coker p. 

• Prove that every subgroup of G of order p is contained in ker p and that every 
subgroup of G of index p contains im p. 

• Prove that the number of subgroups of G of order p equals the number of sub- 
groups of G of index p. 

[6.8] 

6.8. -i Let G be a finite abelian p-group, with elementary divisors p ni p" r («i > 
ra 2 > ■ • • ). Prove that G has a subgroup H with invariant divisors p mi , • • • , p m ‘ 
(mi > r »2 > • • ■ ) if aud only if s < r and m» < n* for i = 1, . . . , a. (Hint: One 
direction is immediate. For the other, with notation as in Exercise 6.7, compare 
ker p for if and G to establish s < t\ this also proves the statement if all n; = 1. 
For the general case use induction, noting that if G = ©, Z/p”‘Z, then p(G) = 
©i 2/p" i_1 Z.) 

Prove that the same description holds for the homomorphic images of G. [6.9] 

6.9. Let if be a subgroup of a finite abelian group G. Prove that G contains a sub- 
group isomorphic to G/H. (Reduce to the case of p-groups; then use Exercise 6.8.) 
Show that both hypotheses ‘finite’ and ‘abelian’ are needed for this result. (Hint: 
Qs has a unique subgroup of order 2.) 

6.10. The dual of a finite group G is the abelian group G v := HoniGrp(G,C‘), 
where C* is the multiplicative group of C. 

• Prove that the image of every o e G v consists of roots of 1 in C, that is, roots 
of polynomials x n — 1 for some n. 

• Prove that if G is a finite abelian group, then G = G v . (flint: First prow this for 
cyclic gronps; then use the classification theorem to generalize to the arbitrary 
case.) 

In Example VHI.6.5 we will encounter another notion of ‘dual’ of a group. 

6.11. • Use the classification theorem for finite abelian groups (Theorem 6.6) to 
classify all finite modules over the ring Z/nZ. 

• Prove that if p is prime, all finite modules over Z/pZ are free 32 . 

6.12. Let G, H, K be finite abelian groups such that Gffi H = G © K. Prove that 
H = K. 

6.13. -> Let G, H be finite abelian groups such that, for all positive integers n, G 
and H have the same number of elements of order n. Prove that G = H. (Note: 
The ‘abelian’ hypothesis is necessary! C\ x C\ and Qs x are nonisomorphic 
groups both with 1 element of order 1, 3 elements of order 2, and 12 elements of 
order 4.) [§11.4.3] 

6.14. Let G be a finite abelian p-group, and assume G has only one subgroup of or- 
der p. Prove that G is cyclic. (This is in some sense a converse to Proposition II.6.1 1 . 
You are welcome to try to prove it ‘by band’, but use of the classification theorem 
will simplify the argument considerably.) 

32 As we will see in Proposition VI.4.10, this property characterizes fields. 
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6.15. Let G be a finite abelian group, and let o € G be an element of maximal 
order in G. Prove that the order of every 6 6 G divides |a|. (This essentially 
reproduces the result of Exercise II.1.15.) 

6.16. Let G be an abelian group of order n, and assume that G has at most one 
subgroup of order d for all d | n. Prove that G is cyclic. 



Chapter V 


Irreducibility and 
factorization in integral 
domains 


We move our attention back to rings and analyze several useful classes of integral 
domains. One guiding theme in this chapter is the issue of factorization: we will 
address the problem of existence and uniqueness of factorizations of elements in a 
ring, abstracting good factorization properties of rings such as 2 or fe[x] (for k a 
field) to whole classes of integral domains. The reader may want to associate the 
following picture with the first part of this chapter: 



Blanket assumption: all rings considered in this chapter will be commutative 1 . In 
fact, most of the special classes of rings we will consider will be integral domains, 


^ Also, recall that all our rings have 1; cf. Definition III. 1.1. 
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that is, commutative rings with 1 and with no nonzero zero-divisors (cf. Defini- 
tion III.1.10). 

1. Chain conditions and existence of factorizations 

1.1. Noetherian rings revisited. Let R. be a commutative ring. Recall that R 
is said to be Noetherian if every ideal of R is finitely generated (Definition III.4.2). 
In fact, this is a special case of the corresponding definition for modules: a mod- 
ule M over a ring R is Noetherian if every submodule of M is finitely generated 
(Definition III.6.6). In §111.6.4 we have verified that this condition is preserved 
through exact sequences: if M, N, P are jR-modules and 

0 >N >M >P >0 

is an exact sequence of f?-inodules, then M is Noetherian if and only if both N 
and P are Noetherian (Proposition III.6.7). An easy and useful consequence of this 
fact is that every finitely generated module over a Noetherian ring is Noetherian 
(Corollary' III.6.8). 

The Noetherian condition may be expressed in alternative ways, and it is useful 
to acquire some familiarity with them. 

Proposition 1.1. Let R be a commutative ring, and let M be an R-modvle. Then 
the following are equivalent: 

(1) M is Noetherian; that is, every submodule of M is finitely generated. 

(2) Every ascending chain of submodules of M stabilizes; that is, if 

Ni c JV 2 c N 3 c ■ • ■ 

is a chain of submodules of M, then 3s such that JVj = = Ni + j = 

(S) Every nonempty family of submodules of M has a maximal element w.r.t. in- 
clusion. 

The second condition listed here is called the ascending chain condition (a.c.c.) 
for submodules. For M = R, Proposition 1.1 tells us (among other things) that a 
ring is Noetherian if and only if the ascending chain condition holds for its ideals. 

Proof. (1) => (2): Assume that M is Noetherian, and let 

JVi C N 2 C AT 3 C • ■ • 
be a chain of submodules of M. Consider the union 

^ = U ^ : 

i 

the reader will verify that JV is a submodule of M. Since M is Noetherian, N is 
finitely generated, say N = («i, . . . , rv). Now n* € N =S- ft* € Ni for some t; 
by picking the largest such i, we see that 3£ such that all ft*, • • ■ , n r are contained 
in N{. But then N C Ni, and since Ni C Ni+i C ■■■ are all contained in N = Ni, 
it follows that Ni = Ni+\ = lV ) + 2 = ... as needed. 

(2) ==>■ (3): Arguing contrapositively, assume that M admits a family & of 
submodules that does not have a maximal element. Construct an infinite ascending 
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chain as follows: let N\ be any element of &\ since N\ is not maximal in & , there 
exists an element N 2 of & such that N\ £ N?\ since N 3 is not maximal in there 
exists an element Na of & such that N% £ IV 3 ; etc. The drain 

NtQNaSNsS--- 
does not stabilize, showing that (2) does not hold. 

(3) =f- (1): Assume (3) holds, and let N be a submodule of M. Then the 
family & of finitely generated subsets of N is nonempty (as (0) € •&); hence it has 
a maximal element N'. Say that N' = (»i, . . . ,n r ). Now we claim that N' = IV: 
indeed, let 71 6 IV; the submodule (ni, . . . , n r , n) is finitely generated, and therefore 
it is in @ ; as it contains N 1 and N 1 is maximal, necessarily (»»i, . . . , n T , n) = N ; in 
particular n € N\ as needed. 

This shows that N = N 1 is finitely generated, and since N CM was arbitrary, 
this implies that M is Noetherian. □ 

Noetherian rings are a very useful and flexible class of rings. In §111.6.5 we 
mentioned the important fact that every finite-type algebra over a Noetherian ring is 
Noetherian. ‘Finite-type (commutative) algebra’ is just a fancy name for a quotient 
of a polynomial ring (§111.6.5), so this is what the fact states: 

Theorem 1.2. Let R. be a Noetherian ring, and let J be an ideal of the polynomial 
ring R [xi, , . . Then the ring R[xi,. .. ,x n }/J is Noetherian. 

Note that finite-type R algebras are (in general) very far from being finitely 
generated as modules over R (cf. again §111.6.5), so it would be foolish to expect 
them to be Noetherian as R.-modules. The fact that they turn out to be Noetherian 
as rings (that is, as modules over themselves) provides us with a huge dass of 
examples of Noetherian rings, among which are the rings of (classical) algebraic 
geometry and number theory. Thus, entire fields of mathematics are a little more 
manageable thanks to Theorem 1.2. 

The proof of this deep fact is surprisingly easy. By Exercise 1.1, it suffices to 
prove that 

R Noetherian => iJfii, . . . , x„] Noetherian: 

and an immediate induction reduces the statement to the following particular case, 
which carries a distinguished name: 

Lemma 1.3 (Hilbert’s basis theorem). R Noetherian ==*• U(x] Noetherian. 

Proof. Assume R is Noetherian, and let I be an ideal of jR[x]. We have to prove 
that I is finitely generated. 

Recall that if f(x) = <yx d + ad- ix d_1 H h ao € #[x] and y* 0, then cy is 

called the leading coefficient of /(x). Consider the following subset of R.: 

A = {0} U {0 € R. | a is a leading coeffident of an dement of /}. 

It is clear that A is an ideal of R (Exercise 1.6); since R is Noetherian, A is finitely 
generated. Thus there exist elements . . . , / r (a r) € I whose leading coeffidents 

«i, . . . , a r generate A as an ideal of R. 
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Now let di be the degree of fi(x), and let d be the maximum among these 
degrees. Consider the sub-A-module 

M = (l,x,x 2 , . . C 

that is, the i?-module consisting of polynomials of degree < d. Since M is finitely 
generated as a module over R, it is Noetherian as an iZ-iuodule (by Corollary III.6.8). 
Therefore, the submodule 

Mnl 

of M is finitely generated over R, s ay by gi(x ), . . . ,g a (x) G I. 

Claim 1.4. 

I = (AW, ■ • . . .,g,(x)). 

This claim implies the statement of the theorem. To prove the claim, we only 
need to prove the C inclusion; to this end, let a{x) € I be an arbitrary polynomial 
in I. If deg a(x) > d, let a be the leading coefficient of a(a:). Then a € A, so 
3&i , . . . , b r 6 R such that 

a = bifti + • • • + b r a r . 

Letting e = deg a(x), so that e > di for all *, this says that 

a(i) - bix e ~ d, fi(x) b T x e ~ dr f r (x) 

has degree < e. Iterating this procedure, we obtain a finite list of polynomials 
ft (a:), . . . , ft (a:) G /?.(*] such that 

«(#) - ft(*)/i(») ft (»)/, (*) 

has degree < d. But this places this element iuJlfn/; therefore 3ci, . . . , c» € R 
such that 


“(*) - Pi(x)fi(x) ft(x)/ r (i) = c igi {x) + • • • + c s g,(x), 

and we are done, since this verifies that 

a(x) = ft (a-)/i(x) + • • • + Pr(x)f r (x) + Cigi(x) + ■ ■ • + c s g 8 (x) 

€ (/i(x), • • ■ , f r (x), gi(x),..., g,(x)), 

completing the proof of Claim 1.4, hence of Lemma 1.3, hence of Theorem 1.2. □ 

1.2. Prime and irreducible elements. Let R be a (commutative) ring, and let 
a,b € R. We say that a divides b, or that a is a divisor of b, or that 6 is a multiple 
of a, if b G (a), that is, 

(3c € H), b= ac. 

We use the notation a \ b. 

Two elements o, b are associates if (o) = (b), that is, if a | b and b | o. 

Lemma 1.5. Let a, b be nonzero elements of an integral domain R.. Then a and 
b are associates if and only if a = ub, for w o unit in R. 
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Proof. Assume a and b are associates. Then 3c, d 6 R such that 

b = ac, a = bd\ 

therefore a = bd= acd, i.e., 

a(l — cd) = 0. 

Since cancellation by nonzero elements hold in integral domains, this implies cd = l. 
Thus c is a unit, as needed. 

The converse is left to the reader. □ 

Incidentally, here the reader sees why it is convenient to restrict our attention 
to integral domains: for this argument to work, a must be a non-zero-divisor. 
For example, in Z/6Z the classes [2]a, [4]e of 2 and 4 are associates according to 
our definition, yet [4]e cannot be written as [2]a times a unit. Away from the 
comfortable environment of integral domains, even harmless-looking statements 
such as Lemma 1.5 may fail. 

The notions reviewed above generalize directly the corresponding notions in Z. 
We are going to explore analogs of other common notions in Z, such as ‘primality’ 
and ‘irreducibility’, in more general integral domains. 

Definition 1.6. Let 1? be an integral domain. 

• An element a € R is prime if the ideal (a) is prime; that is, a is not a unit 
and (cf. Proposition III.4.11) 

a | be (o | b or o | c). 

• An element a € R is irreducible if o is not a unit and 

a = be => (6 is a unit or c is a unit). 

There are useful alternative ways to think about the notion of ‘irreducible’: a 
nonunit a is irreducible if and only if 

• a = be implies that a is an associate of b or of c; 

• a = be implies that (a) = (6) or (a) = (c) (Lemma 1.5); 

• (a) C (6) ==t- (5) = (a) or ( b ) = (1) (Exercise 1.12); 

• (a) is maximal among proper principal ideals (just rephrasing the previous 
point!). 

It is important to realize that primality and irreducibility are not equivalent; 
this is somewhat counterintuitive since they are equivalent in Z, as the reader 
should verify 2 (Exercise 1.13). What is true in general is that prime is stronger 
than irreducible: 

Lemma 1.7. Let R be an integral domain, and let a € R be a nonzero prime 
element. Then a is irreducible. 


2 This fact will be fully explained by tbe general theory, so the reader should work this out 
right away. 
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Proof. Since (o) is prime, (a) ^ (1): hence a is not a unit. If a = be, then 
be = a 6 (a); therefore b 6 (a) or c € (a) since (a) is prime. Assuming without 
loss of generality be (a), we have (b) C (a). On the other hand a = be implies 
(o) C (6): hence (o) = (b), that is, a and b are associates, as needed. □ 

We will soon see under what circumstances the converse statement holds. 

1.3. Factorization into irreducibles; domains with factorizations. 

Definition 1.8. Let R be an integral domain. An element r e R has a factorization 
(or decomposition) into irreducibles if there exist irreducible elements qi,. ..,q n 
such that r = qi • • ■ q n . 

This factorization is unique if the elements qi are determined by r up to order 
and associates, that is, if whenever 

r =Qi ■■■qL 

is another factorization of r into irreducibles, then m = n and gj is an associate of 
qi after (possibly) shuttling the factors. _i 

Definition 1.9. An integral domain R is a domain with factorizations (or ‘factor- 
izations exist in R. ’) if every nonzero, nonunit element r € R. has a factorization 
into irreducibles. j 

Definition 1.10. An integral domain R is factorial, or a unique factorization 
domain (abbreviated UFD), if every nonzero, nonunit element r e R has a unique 
factorization into irreducibles. j 

The terminology introduced in Definition 1.9 does not appear to be too stan- 
dard; by contrast, UFDs are famous. 

We will study the unique factorization condition in §2. For now, it seems 
worth spending a little time contemplating the mere existence of factorizations. 
Interestingly, this condition is implied by an ascending chain condition, for a special 
class of ideals. 

Proposition 1.11. Let R be an integral domain, and let r be a nonzero, nonunit 
element of R. Assume that every ascending chain of principal ideals 

(0 C (n) £ (r») S (ra) £ — 

stabilizes. Then r has a factorization into irreducibles. 

Proof. Assume that r does not have a factorization into irreducible elements. In 
particular, r is itself not irreducible; thus 3n,si € R such that r = ri«i and 
(r) g (ri), (r) £ (sj). If both rj, «i have factorizations into irreducibles, the 
product of these factorizations gives a factorization of r; thus we may assume that 
(e.g.) rj does not have a factorization into irreducibles. Therefore we have 

(r) $ (n) 

and ri does not have a factorization; iterating this argument constructs an infinitely 
increasing chain 


(r) £ (n) £ fa) £ (rs) £ 
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contradicting our hypothesis. □ 

Thus, factorizations exist in integral domains in which the ascending chain con- 
dition holds for principal ideals. This has the following immediate and convenient 
consequence: 

Corollary 1.12. Let R. be a Noetkerian domain. Then factorizations exist in R. 

Proof. By Proposition 1.1, Noetherian domains satisfy the ascending chain condi- 
tion for all ideals. □ 

Corollary 1.12 verities part of the picture presented at the beginning of the 
chapter: the class of Noetherian domains is contained in the class of domains with 
factorization. This inclusion is proper: for instance, the standard example of a 
non-Noetherian ring, 

•••], 

(Example IIL6.5) does have factorizations. Indeed, every given polynomial / € 
Z[*i,* 2 , ■■•) involves only finitely many variables 3 , so it belongs to a subring 
Z[xi, ■ ■ ■ ,a: n ] isomorphic to an ordinary polynomial ring over Z\ further, this sub- 
ring contains every divisor of /. It follows easily (cf. Exercise 1.15) that the ascend- 
ing chain condition for principal ideals holds in Z\xi,x^,xs , ...], because it holds 
in Z[xy, . . . , x n ] (since this ring is Noetherian, by Hilbert’s basis theorem). 


Exercises 


Remember that in this section all rings are taken to be commutative. 

1.1. > Let R be a Noetherian ring, and let / be an ideal of R. Prove that R/I is a 
Noetherian ring. [§1.1] 

1.2. Prove that if f?jx] is Noetherian, so is R. (This is a ‘converse’ to Hilbert’s 
basis theorem.) 

1.3. Let k he a field, and let / e fc[a:], / f k. For every subring R of fcfs] containing 
k and /, define a homomorphism v? : fe[t] — » R. by extending the identity on k and 
mapping t to /. This makes every such R. a fc[f]-algehra (Example III.5.6). 

• Prove that &[a;] is finitely generated as a fc[f]-module. 

• Prove that eveiy subring R as above is finitely generated as a &[t]-module. 

• Prove that eveiy subring of fc[«] containing & is a Noetherian ring. 

1.4. Let R. be the ring of real- valued continuous functions on the interval [0,1]. 
Prove that R is not Noetherian. 

1.5. Determine for which sets S the power set ring &{S) is Noetherian. (Cf. Ex- 
ercise m.3.16.) 

3 Remember that polynomials are finite linear combinations of monomials; cf. §111.1.3. 
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1.6. t> Let I be an Ideal of #[*], and let A C R. be the set defined in the proof of 
Theorem 1.2. Prove that A is an ideal of R. [§1.1] 

1.7. Prove that if R is a Noetherian ring, then the ring of power series f?[[a;]] 
(cf. §111.1.3) is also Noetherian. (Hint: The order of a power series ICSo is the 
smallest i for which o» 0; the dominant coefficient is then a*. Let A; C R be the 
set of dominant coefficients of series of order i in I, together with 0. Prove that 
A, is an ideal of R and Ao C Ai C At C • • • . This sequence stabilizes since R is 
Noetherian, and each A, is finitely generated for the same reason. Now adapt the 
proof of Lemma 1.3.) 

1.8. Prove that every ideal in a Noetherian ring R. contains a finite product of prime 
ideals. (Hint: Let & be the family of ideals that do not contain finite products of 
prime ideals. If & is nonempty, it has a maximal element M since R is Noetherian. 
Since M € M is not itself prime, so 3a, b £ R s.t. a g M, b g M, yet ab € M. 
What’s wrong with this?) 

1.9. -i Let R. be a commutative ring, and let / C R. be a proper ideal. The reader 
will prove in Exercise 3.12 that the set of prime ideals containing I has minimal 
elements (the minimal primes of I). Prove that if R. is Noetherian, then the set 
of minimal primes of I is finite. (Hint: Let & be the family of ideals that do not 
have finitely many minimal primes. If & ^ 0, note that & must have a maximal 
element /, and I is not prime itself. Find ideals Ji , Ji strictly larger than /, such 
that J x J 2 C I, and deduce a contradiction.) [VIA. 10] 

1.10. -i By Proposition 1.1, a ring R is Noetherian if and only if it satisfies the 
a.c.c. for ideals. A ring is Artinian if it satisfies the d.c.c. (descending chain con- 
dition) for ideals. Prove that if R is Artinian and I C R. is an ideal, then R/I is 
Artinian. Prove that if R is an Artinian integral domain, then it is a field. (Hint: 
Let r € R, r ^ 0. The ideals (r") form a descending sequence; hence (r n ) = (r n+1 ) 
for some n. Therefore. . . .) Prove that Artinian rings have Krull dimension 0 (that 
is, prime ideals are maximal in Artinian rings 4 ). [2.11] 

1.11. Prove that the ‘associate’ relation is an equivalence relation. 

1.12. t> Let R be an integral domain. Prove that o e R is irreducible if and only 
if (a) is maximal among proper principal ideals of R. [§1.2, §2.3] 

1.13. t> Prove that prime 4=> irreducible in Z. (§1.2, §2.3] 

1.14. For a, 6 in a commutative ring R, prove that the class of a in R/(b) is prime 
if and only if the class of 6 in R/(a) is prime. 

1.15. t> Identify S = Z[xi, . . . , a„] in the natural way with a subring of the poly- 
nomial ring in countably infinitely many variables R = %[x\,X 2 ,X 3 , . . .]. Prove 
that if / € S and (/) C (g) in R, then g € S as well. Conclude that the ascend- 
ing chain condition for principal ideals holds in R, and hence R is a domain with 
factorizations. [§1.3, §4.3] 

’One can prove that Artinian rings are necessarily Noetherian; in tact, a ring is Artinian if 
and only if it is Noetherian and has Krull dimension 0. Thus, the d.c.c. implies the a.c.c., while 
the a.c.c. implies the d.c.c. if and only if all prime ideals are maximal. 
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1.16. Let 


(*1 — X%,X 2 — x\, .. .)' 

Does the ascending chain condition for principal ideals hold in R! 

1.17. > Consider the subring of C: 

2.{s/— 5] := {a + bi\J 5 1 a, b € Z}. 

• Prove that this ring is isomorphic to Z[i]/(i 2 — 5). 

• Prove that it is a Noetherian integral domain. 

• Define a ‘norm’ N on "L\\J —5] by setting N(a + biy/5) = a 2 + 5b 2 . Prove that 
N(zw) = N(z)N(w). (Cf. Exercise III.4.10.) 

• Prove that the units in Z[v/— 5] are ±1. (Use the preceding point.) 

• Prove that 2, 3, 1 + i%/ 5, 1 - iy / 5 are all irreducible nonassociate elements of 
Z[^=5). 

• Prove that no element listed in the preceding point is prime. (Prove that the 
rings obtained by mod-ing out the ideals generated by these elements are not 
integral domains.) 

• Prove that Z['/-5) is not a UFD. 

[§2.2, 2.18, 6.14] 


2. UFDs, PIDs, Euclidean domains 

2.1. Irreducible factors and greatest common divisor. An integral domain 
R is a UFD if factorizations exist in R. and are unique in the sense of Definition 1.8. 

Thus, in a UFD all elements (other than 0 and the units) determine a multiset 
(a set of elements ‘with multiplicity’; cf. §1.1.1) of irreducible factors, determined 
up to the associate relation. We can also agree that units have no factors; that is, 
the corresponding multiset is 0. 

The following trivial remark is at the root of most elementary facts about 
UFDs, such as the characterization of Theorem 2.5: 

Lemma 2.1. Let R be a UFD, and let a , 6 , c be nonzero elements of R. Then 

• (a) C (b) e=> the multiset of irreducible factors of b is contained in the 

multiset of irreducible factors of a; 

• a and b are associates (that is, (a) = (b)) the two multisets coincide; 

• the irreducible factors of a product be are the collection of all irreducible factors 
of b and of c. 

The proof is left to the reader (Exercise 2.1). The advantage of working in 
a UFD resides in the fact that ring-theoretic statements about elements of the 
ring often reduce to straightforward set-theoretic statements about multisets of 
irreducible elements, by means of Lemma 2.1. 
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One important instance of this mechanism is the existence of greatest common 
divisors. We have liberally used this notion (at least for integers) in previous 
chapters; now we can appreciate it from a more technical perspective. 

Definition 2.2. Let R he an integral domain, and let a,b € R. An element del?, 
is a greatest common divisor (often abbreviated ‘gcd’) of a and b if («, b) C (d) and 
(d) is the smallest principal ideal in R. with this property. j 

In other words, d is a gcd of a and 6 if d | a, d \ b, and 
c | ft, c | 6 => c| d. 

This definition is immediately extended to any finite number of elements. 

Note that greatest common divisors are not defined uniquely by this prescrip- 
tion: if d is a greatest common divisor of a and b, so is every associate of d. Thus, 
the notation ‘gcd(a, 6)’ should only be used for the associate class formed by all 
greatest common divisors of a, b. Of course, language is often (harmlessly) abused 
on this point. For example, the fact that we can talk about the greatest common 
divisor of two integers is due to the fact that in Z there is a convenient way to choose 
a distinguished element in each class of associate integers (that is, the nonnegative 
one). 

Also note that greatest common divisors need not exist (cf. Exercise 2.5); but 
they do exist in UFDs: 

Lemma 2.3. Let R be a UFD, and let a,b be nonzero elements of R. Then a, b 
have a greatest common divisor. 

Proof. We can write 

«=«??> ■q° r , b=vqf 1 ■■■q?'- 

where u and v are units, the elements qi are irreducible, gj is not an associate of qj 
for i j, and Qj > 0, > 0 (so that the multisets of irreducible factors of a, 

resp., 6, consist of those qi for which Qj > 0, resp., ft > 0; the units u, v are 
included since the irreducible factors are only defined up to the associate relation). 
We claim that 

d = A) . . . ? roin(«r,/3r) 

is a gcd of a and b. Indeed, d is clearly a divisor of a and b; and if c also divides 
a and b, then the multiset of factors of c must be contained in both multisets of 
factors for a and b (by Lemma 2.1); that is, 

c = wql' ■■■qf" 

with w a unit and 7; < Qj, 7j < ft. This implies 7; < min (a;, ft-), and hence c | d 
(again by Lemma 2.1), as needed. □ 

Of course the argument given in the proof generalizes one of the standard 
ways to compute greatest common divisors in Z: find smallest exponents in prime 
factorizations. But note that this is not the only way to compute the gcd in Z; 
we will come back to this point in a moment. In fact, greatest common divisors 
in Z have properties that should not be expected in a more general domain: for 
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example, the result of Exercise II.2.13 does not generalize to arbitrary domains 
(and not even to arbitrary UFDs), as the reader will check in Exercise 2.4. 

2.2. Characterization of UFDs. It is easy to construct integral domains where 
unique factorization fails. The diligent reader has already analyzed one example 
(Exercise 1.17); for another, in the domain 5 

R _ 

(aw - yz) 

the (classes of the) elements x, y, z , w are irreducible and not associates of one 
another; since xw — yz = 0 in R, the element r = xw has two distinct factorizations 
into irreducibles: r = xw = yz. 

Note that this ring is Noetherian, by Theorem 1.2 (and in particular factor- 
izations do exist in i?,). Thus, there are Noetherian integral domains that are not 
UFDs. 

Also note that this ring provides an example in which the converse to Lemma 1 .7 
does not hold: indeed, (the class of) x is irreducible, but the quotient 

( C[a?,y,z,w] j ,.\ ^ C[x,y,z,w\ = C[x,y,z>w] 

\ -yz) / ' ) (x, xw - yz) (x, yz) 

is not an integral domain (because y ^ 0, z ^ 0, and yet yz = 0 in this ring); that 
is, x is not prime. 

hi fact, and maybe a little surprisingly, the issue of unique factorization is 
inextricably linked with the relation between primality and irreducibility. Indeed, 
the ‘converse’ to Lemma 1.7 does hold in UFDs: 

Lemma 2.4. Let R be a UFD, and let a be an irreducible element of R. Then a 
is prime. 

Proof. The element a is not a unit, by definition of irreducible. Assume be e (a): 
thus (be) C (a), and by Lemma 2.1 the irreducible factors of a, that is, o itself, 
must be among the factors of b or of c. We have b e (a) in the first case aud c e (a) 
in the second. This shows that (a) is a prime ideal, as needed. □ 

In fact, more is true. Provided that the ascending chain condition for principal 
ideals holds, then UFDs are characterized by the equivalence between irreducibility 
and primality. 

Theorem 2.8. An integral domain R is a UFD if and only if 

• the a.c.c. for principal ideals holds in R. and 

• every irreducible element of R. is prime. 

Proof. ( ==> ) Assume that R is a UFD. Lemma 2.4 shows that irreducible ele- 
ments of R. are prime. To prove that the a.c.c. for principal ideals holds, consider 
an ascending chain 

( n ) g (r 2 ) g (r 3 ) g ■ ■ ■ . 

5 In algebraic geometry, this is the ring of a ‘quadric cone in A 4 ’. The vertex of this cone (at 
the origin) is a singular point, and this has to do with the fact that R is not a UFD. 
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By Lemma 2.1, this chain determines a corresponding descending chain of multisets 
of irreducible factors. A descending chain of finite multisets clearly stabilizes, and it 
follows (by Lemma 2.1 again) that (r*) = (r,+i) = (r i+2 ) = . . . for large enough i. 

( <S= ) Now assume that R satisfies the a.c.c. for principal ideals and irre- 
ducibles are prime. Proposition 1.11 implies that factorizations exist in R; we have 
to verify uniqueness. Let and q [, . . . , q' n be irreducible elements of R, 

and assume 

9l ' ' ' 9m = 9i • • • 4n- 

Then q[ e (?i), and (<jfi) is a prime ideal (by hypothesis); thus 4 € (?i) 
for some i, which we may assume to be 1 after changing the order of the factors. 
Therefore 4\—' w l\ for some u € R Since 4\ is irreducible and q\ is not a unit, 
necessarily a is a unit. Thus q\ and 4i are associates. Canceling q\ and replacing 
4i by uq' 2 , we find 

?z • qm = 92 ' • Vr>- 

Repeating this process matches all factors one by one. It is dear that m = n, 
because otherwise we will obtain 1 = a product of irreducihles, contradicting the 
fact that irreducihles are not units. □ 

Theorem 2.5 is a pleasant statement, but it does not make it particularly easy to 
check whether a given ring is in fact a UFD. The situation is not unlike that with 
Noetherian rings: the a.c.c. is a sharp equivalent formulation of the Noetherian 
condition, hut in practice one relies more often on other tools, such as Hilbert’s 
basis theorem, in order to establish that a given ring is Noetherian. Does an analog 
of Hilbert’s basis theorem hold for unique factorization domains? 

Answering this question requires some preparatory work, and we will come back 
to it in §4. 

2.3. PID => UFD. There are simple ways to produce examples of UFDs. 
Recall (from §111.4) that a principal ideal domain (abbreviated PID) is an integral 
domain hi which every ideal is principal. In §111.4 we have observed that 

• PIDs are Noetherian. 

• S and fc[ar] (where k is a field) are PIDs. 

• Iff?. is a PID and a, 6 € R, then d is a greatest common divisor for a and b if 
and only if (a, b) = (<f). In particular, if d is a greatest common divisor of a 
and b in a PID R, then d is a linear combination of a and b: 3r, s e R such 
that d = ra + sb. 

• If / is a nonzero ideal in a PID, then / is prime if and only if it is maximal. 
We now add one important point to this list: 

Proposition 2.6. If R is a PID, then it is a UFD. 

Proof. Let i? be a PID. Since PIDs are Noetherian, the a.c.c. holds in i? (for all 
ideals, hence in particular for principal ideals); we verify that irreducible elements 
are prime in R, which implies that R is a UFD, by Theorem 2.5. 
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Let o 6 i? be an irreducible element, and assume be 6 (a); we have to show 
that either b or c is in (a). If b € (a), we are done; thus, assume 6 £ (a). Then 
(«, b) 2 («); as R is a PID, 3d € R such that (d) = (a, 5), and hence (d) 2 (a). 
This implies that (o, 6) = (d) = (1), because o is irreducible and ideals generated 
by irreducible elements are maximal among principal ideals (cf. Exercise 1.12). 

Hence 3r, a e B such that ra + sb~ 1. Multiplying by c, we obtain that 

c = rac + sbe € (a) 

as needed. □ 

In particular, &[*] is a UFD if it is a field, and Z is a UFD — which hopefully 
will not come as a big surprise to our readers 6 . Note that at this point we have 
recovered the fact stated in Exercise 1.13 in Ml glory. 

Proposition 2.6 justifies another feature of the picture given at the beginning 
of this chapter: the class of UFDs is contained in the class of PIDs. We will soon 
see that the inclusion is proper, that is, that there are UFDs which are not PIDs; 
for example, Z\x\ is not a PID (Exercise 2.12), yet 

Z[x\ is a unique factorization domain 

as we will soon be able to prove (Theorem 4.14). In fact, there are UFDs which 
are not Noetherian, as represented in the picture; however, these examples are best 
discussed after more material has been developed (Example 4.18). 

The reader can already get a feel for the gap between UFD and PID, by con- 
templating the fact (recalled above and essentially tautological) that, in a PID, 
greatest common divisors of a, b are linear combinations of a and 6. This is a 
very strong requirement: for example, it characterizes PIDs among Noetherian do- 
mains, as the reader will check (Exercise 2.7); it does not hold in general UFDs 
(Exercise 2.4). 

2.4. Euclidean domain =*■ PID. The excellent properties of Z and ft(ar] (where 
k is a field) make these rings even more special than PIDs: they are Euclidean 
domains. 

Informally, Euclidean domains are rings in which one may perform a ‘division 
with remainder’: this is the case for Z and for fc[ar], as observed in §111.4. The point 
is that in both Z and k[x J one can define a notion of ‘size’ of an element: |n| for an 
integer n and deg f(x) for a polynomial f(x). In both cases, one has control over 
the size of the ‘remainder’ in a division. The definition of Euclidean domain simply 
abstracts this mechanism. 

For the purpose of this discussion 7 , a valuation on an integral domain i? is any 
function v : R n {0} — > Z-°. 


6 This feet is known as the fundamental theorem of arithmetic. 

7 Entire libraries have been written on the subject of valuations , studying a more precise 
notion than what is needed here. 
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Definition 2.7. A Euclidean valuation on an integral domain R is a valuation 
satisfying the following property 8 : for all a e R and all nonzero 6 € R. there exist 
q,r e R such that 

a = qb + r, 

with either r = 0 or v(r) < v(b). An integral domain R. is a Euclidean domain if it 
admits a Euclidean valuation. j 

We say that q is the quotient of the division and r is the remainder. Division 
with remainder in Z and in fc[ar] (where k is a field) provide examples, so that Z 
and fc[rr] are Euclidean domains. 

Proposition 2.8. Let R be a Euclidean domain. Then R is a PID. 

The proof is modeled after the instances encountered for Z (Proposition III.4.4) 
and fc[ar] (which the reader has hopefully worked out in Exercise II1.4.4). 

Proof. Let / be an ideal of R; we have to prove that I is principal. If / = {0}, 
there is nothing to show r ; therefore, assume I # {0}. The valuation maps the 
nonzero elements of 7 to a subset of Z-°; let 6 € / be an element with the smallest 
valuation. Then we claim that / = (6); therefore / is principal, as needed. Since 
clearly (fc) C /, we only need to verify that / C (6). 

For this, let a € I and apply division with remainder: we have 

a = qb + r 

for some q, r in R, with r = 0 or v(r) < v(b). But 

r = a- qb 6 I : 

by the minimality of v(b) among nonzero elements of /, we cannot have t>(r) < v(b). 
Therefore r = 0, showing that a = qbe (6), as needed. □ 

Proposition 2.8 justifies one more feature of our picture at the beginning of the 
chapter: the class of Euclidean domains is contained in the class of principal ideal 
domains. 

This inclusion is proper, as suggested in the picture. Producing an explicit 
example of a PID which is not a Euclidean domain is not so easy, but the gap 
between PIDs and Euclidean domains can in fact be described very sharply: PID 
may be characterized as domains satisfying a weaker requirement than ‘division 
with remainder’. 

More precisely, a ‘Dedekind- Hasse valuation’ is a valuation v such that Vo, b, 
either (a, 6) = (6) (that is, b divides o) or there exists r 6 (a, 6) such that v(r) < v(fc) . 
This latter condition amounts to requiring that there exist ?,« 6 J? such that 
as = bq + r with v(r) < v(b); hence a Euclidean valuation (for which we may in fact 
choose s = 1) is a Dedekind-Hasse valuation. It is not hard to show that au integral 
domain is a PID if and only if it admits a Dedekind-Hasse valuation (Exercise 2.21). 
For example, this can be used to show that the ring Z[(l + v/ — 19) /2] is a PID: the 

8 It is not uncommon to also require that v(ab) > v(b) for all nonzero a, be It; but this is not 
needed in the considerations that follow, and cf. Exercise 2.15. 




2. UFDs, PIDs, Euclidean domains 


257 


norm considered in Exercise 2.18 in order to prove that this ring is not a Euclidean 
domain turns out to be a Dedekind-Hasse valuation 9 . Thus, this ring gives an 
example of a P ID that is not a Euclidean domain. 

One excellent feature of Euclidean domains, and the one giving them their 
names, is the presence of an effective algorithm computing greatest common divi- 
sors; the Euclidean algorithm. As Euclidean domains are PIDs, and hence UFDs, 
we know that they do have greatest common divisors. However, the ‘algorithm’ 
obtained by distilling the proof of Lemma 2.3 is highly impractical; if we had to 
factor two integers a, b in order to compute their gcd, this would make it essentially 
impossible (with current technologies and factorization algorithms) for integers of 
a few hundred digits. The Euclidean algorithm bypasses the necessity of factoriza- 
tion: greatest common divisors of thousand-digit integers may be computed in a 
fraction of a second. 

The key lemma on which the algorithm is based is the following trivial general 
fact: 

Lemma 2.9. Let a = bq + r in a ring B. Then (a, 6) = ( b , r). 

Proof. Indeed, r = a — bq € (a, b), proving ( 6 ,r) C (a, 6 ); and a = bq + r e (b, r), 
proving («, 6 ) C ( b,r ). □ 

In particular, 

(Vc6 B), (a,b) C (c) <=S> (b,r) C (c); 

that is, the set of common divisors of a, b and the set of common divisors of 6 , r 
coincide. Therefore, 

Corollary 2.10. Assume a = bq + r. Tken a, b have a gcd if and only if b, r have 
a gcd, and in this case gcd(o, 6 ) = gcd( 6 , r). 

Of course ‘gcd(a, b) = gcd( 6 , r)’ means that the two classes of associate elements 
coincide. 

These considerations hold over any integral douiaiu; assume now that R is a 
Euclidean domain. Then we can use division with remainder to gain some control 
over the remainders r. Given two elements a, b in R, with fe ^ 0 , we can apply 
division with remainder repeatedly: 

a = 691 + ri, 

b = n«2 + r 2, 
n = r 2 g 2 + r 3 , 


as long as the remainder n is nonzero. 

Claim 2.11. This process terminates: that is, rjv = 0 for some N. 

®This boils down to a case-by-case analysis, which we are happily leaving to our most patient 
readers. 
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Proof. Each line in the table is a division with remainder. If no r\ were zero, we 
would have an infinite decreasing sequence 

v(b) > t>(ri) > u(r 2 ) > u(r$) > • • • 

of nonnegative integers, which is nonsense. □ 

Thus the table of divisions with remainders must be as follows: letting ro = b, 
a = r 0 ?i +n, 

{, = ri<f 2 + r 2 , 

n = r 2 q 2 + rs, 


rN - 3 = r N—2QN—2 + r A r -l J 
r N-2 = rN-lQN-l 

with rN-l ^ 0. 

Proposition 2.12. With notation as above, r^-i is o gcd of a, b. 

Proof. By Corollary 2.10, 

gcd (a, b) = gcd (6, n) = gcdfo,^) = • • ■ = gcd(rjv_ 2 , r N -i). 

But rjv- 2 = rn-iqN-i gives tn -2 6 (rjv-i); hence (rjy- 2 ,rw-i) = (rar-i). There- 
fore r^r_i is a gcd for and hence for a and b, as needed. □ 

The ring of integers and the polynomial ring over a field are both Euclidean 
domains. Fields are Euclidean domains (as represented in the picture at the be- 
ginning of the chapter), but not for a very interesting reason: the remainder of the 
division by a nonzero element in a field is always zero, so every function qualifies 
as a ‘Euclidean valuation’ for trivial reasons. 

We will study another interesting Euclidean domain later in this chapter (§6.2). 


Exercises 


2.1. > Prove Lemma 2.1. [§2.1] 

2.2. Let if. be a UFD, and let a, b, c be elements of if such that a \ be and gcd(a, b) = 
1 . Prove that a divides c. 

2.3. Let n be a positive integer. Prove that there is a one-to-one correspondence 
preserving multiplicities between the irreducible factors of n (as an integer) and 
the composition factors of Z/nZ (as a group). (In fact, the Jordan-Holder theorem 
may be used to prove that Z is a UFD.) 

2.4. > Consider the elements x, y in Z\x, y]. Prove that 1 is a gcd of x and y, and 
yet 1 is not a linear combination of x and y. (Cf. Exercise II.2.13.) [§2.1, §2.3] 
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2.5. t> Let R. be the subring of Z[t] consisting of polynomials with no term of 

degree 1: ac + < 12 1 2 + 1* 

• Prove that R is indeed a subring of Z[t], and conclude that R is an integral 
domain. 

• List all common divisors of t 5 and t 6 in R. 

• Prove that t 5 and t 6 have no gcd in R. 

[§2-l] 

2.6. Let R. be a domain with the property that the intersection of any family of 
principal ideals in R is necessarily a principal ideal. 

• Show that greatest common divisors exist in R. 

• Show that UFDs satisfy this property. 

2.7. > Let R. be a Noetherian domain, and assume that for all nonzero a, b in R., 
the greatest common divisors of a and b are linear combinations of a and b. Prove 
that R is a PID. [§2.3] 

2.8. Let R be a UFD, and let I ^ (0) be an ideal of R. Prove that every descending 
chain of principal ideals containing I must stabilize. 

2.9. -> The height of a prime ideal P in a ring R is (if finite) the maximum length h 

of a chain of prime ideals Po S Pi £ • £ Ph — P in R- (Thus, the Krull dimension 

of R, if finite, is the maximum height of a prime ideal in R.) Prove that if J? is a 
UFD, then every prime ideal of height 1 in R. is principal. [2.10] 

2.10. -i It is a consequence of a theorem known as Krull’s Havptidealsatz that every 
nonzero element in a Noetherian domain is contained in a prime ideal of height 1. 
Assuming this, prove a converse to Exercise 2.9, and conclude that a Noetherian 
domain R. is a UFD if and only if every prime ideal of height 1 in R is principal. 
[4.16] 

2.11. Let R be a PID, and let / be a nonzero ideal of R. Show that R.jl is an 
artinian ring (cf. Exercise 1.10), by proving explicitly that the d.c.c. holds in R/I. 

2.12. t> Prove that if R{x\ is a PID, then J? is a field. (§2.3, §VI.7.1] 

2.13. > For o, 6, c positive integers with c > 1, prove that c° — 1 divides c 6 — 1 if 
and only if a \ b. Prove that x a — 1 divides x b - 1 in Z[x] if and only if a | 6. (Hint: 
For the interesting implications, write 6 = ad + r with 0 < r < a, aud take ‘size’ 
into account.) [§VII.5.1, VH.5.13] 

2.14. t> Prove that if k is a field, then fc[[x]] is a Euclidean domain. [§4.3] 

2.15. t> Prove that if R. is a Euclidean domain, then R. admits a Euclidean valua- 
tion v such that v(ab) > v(b) for all nonzero a, b € R. (Hint: Since R is a Euclidean 
domain, it admits a valuation v as in Definition 2.7. For a ^ 0, let v(a) be the 
minimum of all v(ab) as 6 € R-, b / 0. To see that R. is a Euclidean domain with 
respect to V as well, let a, b be nonzero in R, with b [ a; choose q, r so that o = bq+ t\ 
with v(r) minimal; assume that F(r) > F(6), and get a contradiction.) [§2.4, 2.16] 
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2.16. Let R. be a Euclidean domain with Euclidean valuation v; assume that 
v(ab) > v(b) for all nonzero a, 6 € R (cf. Exercise 2.15). Prove that associate 
elements have the same valuation and that units have minimum valuation. 

2.17. -i Let R he a Euclidean domain that is not a field. Prove that there exists a 
nonzero, nonunit element c in R such that Va e R, 3q, r € R with a = qc + r and 
either r = 0 or r a unit. [2.18} 

2.18. > For an integer d, denote by Q(\/d) the smallest subfield of C containing 
Q and y/d, with norm N defined as in Exercise II1.4.10. See Exercise 1.17 for the 
case d = —5; in this problem, you will take d = — 19. 

Let S = (1 + iy/ 19)/2, and consider the following subring of Q(V— W): 

Z [S}~ |a + 6 1+ ^ |a,6ez|. 

• Prove that the smallest values of N(z) for z = a + b6 € Z [5] are 0, 1, 4, 5. Prove 
that N(a + bS) > 5 if b ^ 0. 

• Prove that the units in Z[<5] are ±1. 

• If c € Z[<5] satisfies the condition specified in Exercise 2.17, prove that c must 
divide 2 or 3 in Z[5), and conclude that c = ±2 or c = ±3. 

• Now show that pq € Z[<5] such that S = qc + r with c = ±2, ±3 and r = 0, ±1. 
Conclude that Z[(l + y/— 19)/2] is not a Euclidean domain. [§2.4, 6.14] 

2.19. * A discrete valuation on a field k is a homomorphism of abelian groups 
t> : (fc*, •) — > (Z, +) such that u(a + 6) > min(u(o), , u(5)) for all a, b € fe* such that 
a + bek m . 

• Prove that the set R := {o e k* \ v(a) > 0} U {0} is a subring of k. 

• Prove that R is a Euclidean domain. 

Rings arising in this fashion are called discrete valuation rings, abbreviated DVR. 
They arise naturally in number theory and algebraic geometry. Note that the 
Krull dimension of a DVR. is 1 (Example III.4.14); in algebraic geometry, DVRs 
correspond to particularly nice points on a ‘curve’. 

• Prove that the ring of rational numbers a/6 with 6 not divisible by a fixed prime 
integer p is a DVR . 

[2.20, VIII. 1.19} 

2.20. -■ As seen in Exercise 2.19, DVRs are Euclidean domains. In particular, they 
must he PIDs. Check this directly, as follows. Let R be a DVR., and let t € R. be an 
element such that v(t) — 1. Prove that if I C R is any nonzero ideal, then I = (t k ) 
for some k > 1. (The element t is called a ‘local parameter’ of R.) [4.13, VH.2.18] 

2.21. t> Prove that an integral domain is a PID if and only if it admits a Dedekind- 
Hasse valuation. (Hint: For the 4= implication, adapt the argument in Proposi- 
tion 2.8; for ==*■ , let v(a) be the size of the multiset of irreducible factors of a.) 
[§2-4] 
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2.22. -i Suppose R C S is an inclusion of integral domains, and assume that R is 
a PID. Let a, b G i?, and let d € R be a gcd for o and 6 in R. Prove that d is also 
a gcd for a and 6 in S. (5.2] 

2.23. Compute d = gcd (55042276 17645696, 29224760451 10123). Further, find a, b 
such that d = 5504227617645696 a + 29224760451101235. 

2.24. > Prove that there are infinitely many prime integers. (Hint: Assume by 
contradiction that Pi , . . . , pn is a complete list of all positive prime integers. What 
can yon say about pi • • • • • pn + 17 This argument was already known to Euclid, 
more than 2,000 years ago.) [2.25, §5.2, 5.11] 

2.26. — > Variation on the theme of Euclid from Exercise 2.24: Let f(x) G Z[x] be 
a nonconstant polynomial such that /( 0) = 1. Prove that infinitely many primes 
divide the numbers f(n), as n ranges in Z. (If pi, . . .,pn were a complete list of 
primes dividing the numbers fin), what could you say about f(pi • • -pno;)'!) 

Once you are happy with this, show that the hypothesis /( 0) = 1 is unnecessary. 
(If /( 0) = a 5^0, consider f(pi ■ ■ -pffax). Finally, note that there is nothing special 
about 0.) [VII.5.18] 


3. Intermezzo: Zorn’s lemma 

3.1. Set theory, reprise. We leave ring theory for a moment and take a little 
detour to contemplate an issue from set theory. As remarked at the very outset, 
only naive set theory is used in this book; all set-theoretic operations we have used 
so far are nothing more than a formalization of intuitive ideas regarding collections 
of objects. However, we will occasionally need to refer to a less ‘intuitively obvious’ 
set-theoretic statement: for example, this statement is needed in order to show that 
every ideal in a ring is contained in a maximal ideal (Proposition 3.5). 

This set-theoretic feet is Zorn’s lemma. An order relation on a set Z is a 
relation ■< which is reflexive, transitive, and antisymmetric: the first two terms are 
familiar to the reader, and the third means that 

(Va, b G Z), a ■< b and b ^ a a = b. 

Typical prototypes are the < relation on Z or the inclusion relation C among subsets 
of a given set. We use o -< b to denote a K b and b ^ a. 

A pair (Z, ^), consisting of a set Z and an order relation zi on Z, is called 
a poset, for partially ordered set. The qualifier ‘partially’ is not necessary, but 
convenient, as it reminds us that if a, b € Z, then it is not necessarily the case 
that a ■< b or b ■< a: for example, C does not satisfy this additional requirement 
in general (while (Z, <) does). An order is total if it does satisfy this additional 
requirement. A totally ordered set is not called a ‘toset’, as would seem reasonable, 
but rather a chain. 

An element m of a poset Z is maximal if nothing comes properly ‘after’ it in 
the order: 

(Va G Z), m ^ a m = a. 
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For example, maximal ideals in a ring are maximal elements in the set of proper 
ideals, that is, ideals ^ (1) (by Proposition III.4.11). An upper bound for a subset 
S of a poset Z is an element ue Z coming after every element of S: 

(Va e S), a ■< u. 

Notions of ‘smallest’ (or, rather, ‘least’) or ‘largest’ are defined in the evident 
way. 

Posets may or may not have maximal elements, upper bounds, etc.: for exam- 
ple, the family of finite subsets of on infinite set does not have maximal elements. 

All this terminology comes together in the following statement: 

Lemma 3.1 (Zorn’s lemma). Let Z be a nonempty poset. Assume that every chain 
in Z has an upper bound in Z; then there exists a maximal element in Z. 

The status of Zorn’s lemma is peculiar: on the one hand, it is complex enough 
that no one we know of finds it ‘intuitively clear’; on the other hand, it turns out to 
be logically equivalent 10 to the axiom of choice, which does look reasonable to most 
people, and to the ‘well-ordering theorem’, which we find intuitively unreasonable. 
We will not prove all these equivalences (the diligent reader will prove them in the 
exercises and should in any case have no trouble locating more detailed proofs) ; 
but we will attempt to describe the situation in general terms and explain why the 
unreasonable statement implies the other two. 

The axiom of choice states that if & is a family of disjoint nonempty snhsets of 
a set Z , then we cau form a new set by selecting one element x from each X € & ■ 
This may sound reasonable, but it raises rather subtle points: for example, it can 
be shown that this axiom is independent of the other axioms of (Zermelo-Rrankel) 
set theory. Also, it has disturbingly counterintuitive consequences, such as the 
Banach- Tarski paradox 11 . 

The subtlety of the axiom of choice boils down to the following question: how 
does one choose the element x? This is not controversial if Z (and hence &) is 
finite, but, not surprisingly, it becomes an issue if Z is infinite. 

A suitable order relation on Z would come in handy here: if < is an order 
relation on Z such that every nonempty subset of Z has a least element, then we 
could simply let x be the least element of X for each X € & . We say that Z 
is well-ordered by or that S is a well-ordering on Z, if this is the case. (The 
abbreviation woset is also used in the literature, but uot very often.) For example, 
the set Z >0 of positive numbers is well-ordered by <; this fact is called the well- 
ordering principle. 

Thus, the statement of the axiom of choice is really ‘clear’ if the set Z is the 
set Z >0 of positive numbers. It may seem somewhat less so for the set Z, since Z 
is not well-ordered by <; however, it takes a moment (Exercise 3.4) to construct a 


10 Equivalent in the sense that each can be derived from the other together with the other 
axioms of ‘Zermelo-Ftankel* set theory. 

n One can subdivide a solid ball of radius 1 into finitely many pieces, then reassemble them 
after rotations and translations in 3-space and obtain two balls of radiuB 1. 
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different relation on Z, making Z a well-ordered set. Thus, the axiom of choice is 
also completely transparent for Z = Z or any countable set (like Q) for that matter. 

The well-ordering principle is at the basis of proofs by induction: in fact (cf. Ex- 
ercise 3.5) it is equivalent to the so-called principle of induction. More is true, 
however. If (Z, K) is any woset, then we can consider the following ‘principle of 
induction’ on Z: 

Let S C Z be a subset such that Va € Z 

((Vb € Z), b -< a =h b € S) =h a € S\ 

then S = Z. 

That is, Z is the only set S with the property that a€Sifallfc-<a are in S. 
(Note that the least element o of Z is automatically in S, since the condition on 
all 6 -< o is vacuously true in this case.) 

The reader will recognize that for Z = Z >0 with the relation <, this is the 
ordinary principle of induction. 

Claim 3.2. Let Z be any woset. Then the principle of induction holds for Z. 

Proof. Let S C Z be a subset with the property given above, and assume S £ Z. 
Then the complement T of S in Z is nonempty; hence it has a least element a. 
Now if b -< a, then necessarily be S (otherwise beT, contradicting the fact that a 
is the least element in T). But the property of S then forces a e S, contradiction. 
Therefore T is empty; i.e., S = Z. □ 

This is remarkable, since it extends induction to uncountable sets 12 , provided 
a well-ordering is available. 

For example, if we had a well-ordering on It, then we could prove a statement 
about all reals ‘by induction’. Many people find this somewhat counterintuitive, 
and hence so seems the following amazing claim: 

Theorem 3.3 (Well-ordering theorem). Every set admits a well-ordering. 

As argued above, this implies that the axiom of choice holds on every set: if you 
accept the well-ordering theorem, the statement of the axiom of choice becomes as 
evident for every set as it is for the set of positive integers. The catch is of course 
that the axiom of choice is used in the proof of the well-ordering theorem; in fact, 
the statements are equivalent. 

In any case, the statement of the well-ordering theorem is easy to absorb (even 
if perhaps less ‘intuitively clear’ than the axiom of choice). The well-ordering 
theorem is equivalent to Zorn’s lemma, since they are both equivalent to the axiom 
of choice. The good news is that the derivation of Zorn’s lemma from the well- 
ordering theorem is reasonably straightforward: 

Well-ordering theorem =>■ Zorn’s lemma. Let (Z, <) be a nonempty poset such 
that every chain in Z has an upper bound in Z. By the well-ordering theorem, there 

12 Even beyond; in the context of ordinals this induction principle is called transfiniie indue- 


tion. 
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is a well-ordering 13 ^ on Z. Define a function / from Z to the power set of Z as 
follows: 


/(«) = 


{«} if({a}uU/(6)) is totally ordered by <; 

0 if ({a} U [J f(b)) is not totally ordered by <. 


The fact that Z is well-ordered by ^ implies that / is defined for every o € Z. 
Indeed, let T be the set of elements of Z for which / is not defined; if T 0, 
T has a least element a w.r.t. K; but then f(b) is defined for all 6 x a, and the 
prescription given for f(a) defines / at a, a contradiction. 

Let S = (J a€Z /(a). It is clear that S is totally ordered by <: if a, b € S, and 
(say) 6 -< a, then both a and b belong to 


b-<a 


which is totally ordered by < by construction. 

We claim that S is a maximal totally ordered subset 14 of Z: 


Claim 3.4. If S C S' C Z and S' is totally ordered by <, then S - S'. 


Indeed, let T be the complement of S in S'. If T is nonempty, let a € T and 
observe that 

MuU/W 

b-«x 

is totally ordered hy <, since it is a subset of {a} USC S'. But then /(a) = {a}, 
that is, a € S, a contradiction. 

Since 5 is a chain, by the hypothesis of Zorn’s lemma S has an upper bound, 
m. It is now clear that m must be maximal in Z w.r.t. <, verifying the statement 
of Zorn’s lemma. Indeed, if m' > m, then S U {m'} is totally ordered; hence 
S — SU { m '} by the claim. This means ml e S, and hence m! — m since m is an 
upper bound for S. □ 

Zorn’s lemma is the key to several basic results in algebra, the first of which we 
are about to encounter; the reader will sample a few more results in the exercises. 
The reader will also encounter equally basic applications of Zorn’s lemma (or of 
other manifestations of the axiom of choice) in other fields: for example Ty chon off’s 
theorem in topology' and the Hahn-Banach theorem in functional analysis. 


3.2. Application: Existence of maximal ideals. Recall (§111.4.3) that an ideal 
m of a ring R is maximal if and only if i?./m is a field, if and only if no other ideal 
stands between I and R = (1), that is, if and only if m is maximal with respect to 
inclusion, in the family of proper ideals of R. It is not obvious from this definition 
that maximal ideals exist, but they do: 

13 Watch out: wa are considering two orderings on Z: < (about which we want to say 
something) and ^ (about which we know something already). 

14 The existence of maximal totally ordered subsets is known as the Hans dor ff maximal 
principle; it is also equivalent to the axiom of choice. 
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Proposition 3.5. Let I / (1) be a proper ideal of a commutative ring R. Then 
there exists a maximal ideal m of R containing I. 

This is immediate if R. is Noetherian, by applying condition (3) from Propo- 
sition 1.1 to the family of proper ideals of R containing I. The argument for 
arbitrary rings is a classic application of Zorn’s lemma- In fact, this is the first 
application given by M. Zorn in his 1935 article introducing a ‘maximum principle’ 
(now known as Zom’s lemma). The result had earlier been proven by Krull, using 
the well-ordering theorem. 

Proof. The set Jt of proper ideals of R. containing I is ordered by inclusion. Then 
let be a chain of proper ideals, and consider 

v-.= LU 

jet 

We claim that U is a proper ideal containing /; hence it is an upper bound for if 
in ■ This proves that every chain in has an upper bound, and it follows that 
<0 has maximal elements, by Zom’s lemma. 

Tb verify our claim, it is clear that U contains I and that it is an ideal (for 
example, if a, b € U, then 3 ie / such that a, b € J; hence a±bS J, and therefore 
o±f> € U). We have to check that U is proper. But if U = (1), then 1 € J for some 
J6/, contradicting the fact that consists of proper ideals. □ 

Note that this argument relies crucially on the fact that R has a multiplicative 
unit 1, and indeed the stated fact is not true in general for rings without 1 (Exer- 
cise 3.9). Also, the use of Zorn’s lemma is not just a convenient trick; the statement 
is known to be equivalent to the axiom of choice, by work of W. Hodges. 


Exercises 


3.1. Prove that every well-ordering is total. 


3.2. Prove that a totally ordered set (Z, ;<) is a woset if and only if every descending 
chain 

z\ h h *3 >: • • • 


in Z stabilizes. 


3.3. Prove that the axiom of choice is equivalent to the statement that a set- 
function is surjective if and only if it has a right-inverse (cf. Exercise 1.2.2). 

3.4. > Construct explicitly a well-ordering on Z. Explain why you know that Q 
can be well-ordered, even without performing an explicit construction. [§3.1) 

3.5. > Prove that the (ordinary) principle of induction is equivalent to the state- 
ment that < is a well-ordering on Z >0 . (To prove by induction that (Z >0 , <) 
is well-ordered, assume it is known that 1 is the least element of Z >0 and that 
Vn e Z >0 there are no integers between n and n + 1.) [§3.1] 
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3.6. Id this exercise assume the truth of Zorn’s lemma and the conventional set- 
theoretic constructions; you wall be proving the well-ordering theorem. 

Let Z be a nonempty set, and let 22 be the set of pairs (5, <) consisting of a 
subset S of Z and of a well-ordering < on S. Note that 22 is not empty (singletons 
can be well-ordered). Define a relation X on 3? by prescribing 

(S,<) < (T, <') 

if and only if S' C T, < is the restriction of <' to S, and every element of S precedes 
every element of T \ S w.r.t. <\ 

• Prove that is an order relation in 22. 

• Prove that every chain in 2? has an upper bound in 22. 

• Use Zorn’s lemma to obtain a maximal element (M, <) in 22. Prove that M — Z. 
Thus every set admits a well-ordering, as stated in Theorem 3.3. 

3.7. In this exercise assume the truth of the axiom of choice and the conventional 
set-theoretic constructions; you will be proving the well-ordering theorem 15 . 

Let Z be a nonempty set. Use the axiom of choice to choose an element 
7 (S) £ S for each proper subset $ £ Z- Call a pair ($, <) a 7 -woset if S C Z, < 
is a well-ordering on S, and for every « € S, a = 7 ({b € 5, b < a}). 

• Show how to begin constructing a 7 -woset, and show that all 7 -wosets must 
begin in the same way. 

Define an ordering on 7 -wosets by prescribing that (U, <") < (T, <') if and only if 
U CT and <" is the restriction of <'. 

• Prove that if (U, <") -< (T, <'), then 7 (U) e T. 

• For two 7 -wosets (S,<) and (T, < ; ), prove that there is a maximal 7 -woset 
(U, <") preceding both w.r.t. (Note: There is no need to use Zorn’s lemma!) 

• Prove that the maximal 7 -woset found in the previous point in fact equals (5, <) 
or (T, <'). Thus, ■< is a total ordering. 

« Prove that there is a maximal 7 -woset (A/,<) w.r.t. ■<. (Again, Zorn’s lemma 
need not and should not be invoked.) 

• Prove that M = Z. 

Thus every set admits a well-ordering, as stated in Theorem 3.3. 

3.8. Prove that every nontrivial finitely generated group has a maximal proper 
subgroup. Prove that (Q, +) has no maximal proper subgroup. 

3.9. > Consider the rng (= ring without 1; cf. §111.1.1) consisting of the abelian 
group (Q, +) endowed with the trivial multiplication qr = 0 for all q, r € Q. Prove 
that this mg has no maximal ideals. (§3.2] 

3.10. -1 As shown in Exercise I1I.4.17, every maximal ideal in the ring of continuous 
real-valued functions on a compact topological space K consists of the functions 
vanishing at a point of K. 

15 We learned this proof from notes of Dan Gray boh. 
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Prove that there are maximal ideals in the ring of continuous real- valued func- 
tions on the real line that do not correspond to points of the real line in the same 
fashion. (Hint: Produce a proper ideal that is not contained in any maximal ideal 
corresponding to a point, and apply Proposition 3.5.) [IIf.4.17] 

3.11. Prove that a UFD R is a PID if and only if every nonzero prime ideal in R is 
maximal. (Hint: One direction is Proposition III.4.13. For the other, assume that 
every nonzero prime ideal in a UFD R is maximal, and prove that every maximal 
ideal in R is principal; then use Proposition 3.5 to relate arbitrary ideals to maximal 
ideals, and prove that every ideal of R is principal.) 

3.12. -> Let R be a commutative ring, and let / C R. be a proper ideal. Prove that 
the set of prime ideals containing / has minimal elements. (These are the minimal 
primes of I.) [1.9} 

3.13. -> Let R be a commutative ring, and let N be its nilradical (Exercise III.3.12). 
Let r £ N. 

• Consider the family & of ideals of R. that intersect the ideal (r) only at 0. Prove 
that & has maximal elements. 

• Let / be a maximal element of & . Prove that / is prime. 

• Conclude r & N =>■ r is not in the intersection of all prime ideals of R. 

Together with Exercise III.4.18, this shows that the nilradical of a commutative 
ring R equals the intersection of all prime ideals of R. [III.4.18, VII.2.8) 

3.14. -i The Jacobson radical of a commutative ring R is the intersection of the 
maximal ideals in R. (Tims, the Jacobson radical contains the nilradical.) Prove 
that r is in the Jacobson radical if and only if 1 + rs is invertible for every s e R. 
[Vl.3.8] 

3.15. R ecall that a (commutative) ring R is Noetherian if every ideal of R is finitely 
generated. Assume the seemingly -weaker condition that every prime ideal of R, is 
finitely generated. Let & be the family of ideals that are not finitely generated in 
R. You will prove & — 0. 

• If & 0, prove that it has a maximal element I. 

• Prove that R/I is Noetherian. 

• Prove that there are ideals J\, J 2 containing /, such that J\ Jj C /. 

• Give a structure of R/I module to I/J 1 J 2 and J\/J\J 2 - 

• Prove that I/J 1 J 2 is a finitely generated f?//-module. 

• Prove that I is finitely generated, thereby reaching a contradiction. 

Thus, a ring is Noetherian if and only if its prime ideals are finitely generated. 


4. Unique factorization in polynomial rings 

We now return to regular programming and study unique factorization in polyno- 
mial rings; we will finally establish the fact (already hinted at) that i?[x] is a UFD 
if R is a UFD. 





268 


V. Irreducibility and factorization in integral domains 


Among the necessary preparatory work we also discuss the field of fractions of 
an integral domain; this is a particular instance of the process of localization of a 
ring (or a module) at a multiplicative subset, which the diligent reader will explore 
a little more in the exercises. 

4.1. Primitivity and content; Gauss’s lemma. One issue that we have en- 
countered already, and will encounter again, is the description of the ideals of 
the polynomial ring /?.[*], given enough information about R. For example, we 
have proved that the ideals of fc[.T] are principal if k is a held, and Hilbert’s basis 
theorem shows that all ideals of A[a;] are finitely generated if all ideals of R. are 
(Exercise III.4.4 and Lemma 1.3). 

An even more naive observation is simply that every ideal I of R. generates an 
ideal of 

:= {«o + oi*H 1- OdX d £ f?[rr] £ /}. 

Lemma 4.1. Let R be a ring , and let I be an ideal of F. Then 

LRI*] “ / l1 ' 

The proof of this lemma is a standard application of the first isomorphism 
theorem and is left to the reader (Exercise 4.1). 

Corollary 4.2. If I is a •prime ideal of R, then f is prime in /?[*]. 

Proof. If I is prime in R, then R/I is an integral domain; hence so is R\x\/IR{x\ = 
(#//)[*), and therefore //?[*] is prime in #[«). □ 

We will use this fact in a moment. 

The following definitions can be studied for every commutative ring; our main 
application will be to UFDs. 

Definition 4.3. Let R be a commutative ring, and let 

f = ao + aix + adx d € /?[#) 

be a polynomial. 

• / is very primitive if for all prime ideals p of R, f & p /7.[m] . 

• / is primitive if for all principal prime ideals p of R, f f p/?[ar). j 

The notion of ‘primitive’ polynomial is standard, but it is not usually pre- 
sented in this way; the standard definition is the equivalent formulation given in 
Lemma 4.5. As for ‘very primitive’, this term is essentially a joke — our excuse for 
bringing up this notion is that it is very natural and that unfortunately some ref- 
erences blur the distinction between this notion and the notion of ‘primitive’. We 
hope to ward off any possible confusion by being rather explicit on this point. Very 
primitive polynomials are primitive, but the converse does not hold in general, even 
for UFDs (cf. Exercise 4.3). 

Perhaps the most important fact about ‘primitivity’ is the following easy re- 
mark. 




4. Unique factorization in polynomial rings 


269 


Lemma 4.4. Let R. be a commutative ring. Then for f,g € J?[x] 
fg is primitive 4=> both f and g are primitive. 

Proof. This is an easy consequence of Corollary 4.2: 

fg primitive 4=$- Vp prime and principal in R, fg pf?[xj 

4=> Vp prime and principal in R, f & pJ?[arJ and g §1 p/J(*] 

/ is primitive and g is primitive 

since pft[a) is prime if p is a prime ideal, □ 

The analogous equivalence holds for very primitive polynomials {Exercise 4.4). 
Lemma 4.4 will be ultimately responsible for the fact that f?[ar] is a XJFD if R is 
a UFD, which is our main objective in this section. If R is a UFD, the notion 
of ‘primitive’ has the following interpretation; we accompany it with the ‘very 
primitive’ case, for comparison. 

Lemma 4.5. Let R be a commutative ring and f = ao + a\x H h aax d e /?[*] 

os above. 

• f is very primitive if and only if (a<>, ■ ■ ■ , «a) = (!)• 

• If R is a UFD, then f is primitive if and only i/gcd(ao. . . . , «d) = 1. 

Proof. If (ap, . . . , aj) = (1), then no prime ideal can contain all coefficients a,-, 
and it follows that / is very primitive. Conversely, if / is very primitive, then the 
coefficients of / are not all contained in any one prime ideal, and in particular they 
are not all contained in any maximal ideal (since maximal ideals are prime). Thus, 
(ao, . • • , ad) = (1) in this case, since every proper ideal is contained in a maximal 
ideal by Proposition 3.5. This proves the first point. 

For the second point note that, in a UFD, gcd(ao , . . . , a^) ^ 1 if and only if 
there exists an irreducible element q e R such that (ao, ■ ■ ■ , <id) C (q). As (q) is 
then prime (by Lemma 2.4), the second point follows as well. □ 

With this in mind, in order to capitalize on Lemma 4.4, it is convenient to give 
a name to the gcd of the coefficients of a polynomial. We now assume that I? is a 
UFD, since this is the case of greatest interest in the applications. 

Definition 4.6. Let i? be a UFD. The content of a nonzero polynomial / € R{x J, 
denoted cont /, is the gcd of its coefficients. j 

Of course the content of a polynomial is only defined up to units. We find this 
ambiguity distasteful. What is uniquely determined is the principal ideal generated 
by the content of /, which we will denote 

(cont/) : 

thus / is primitive precisely when (cont/) = (1). We will take a somewhat stubborn 
approach and deal with principal ideals throughout, rather with individual (but only 
defined up to unit) elements. The reader should keep in mind that ideals may be 
multiplied (cf. §111.4.1), and if (a), ( 6 ) are principal ideals, then their product (a)(6) 
is the principal ideal (a6). 
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The proof of the following remarks is immediate from the definition; hence it 
is left to the reader (Exercise 4.5): 

Lemma 4.7. Let R be a UFD, and let f 6 Then 

• (/) = (cont/)(/), where f is primitive; 

• */(/) = (c)(5), with c€ R and g primitive, then (c) = (cont/). 

In view of its consequences, the following fact is rather profound, and it there- 
fore deserves a name. Some references call the result of Lemma 4.4, or its main 
consequence (Theorem 4.14), Gauss’s lemma. We prefer to use this name for the 
following statement. 

Proposition 4.8 (Gauss’s lemma). Let R be a UFD, and let f,g€ J7.[sr] . Then 
(cout/ 9 ) = (cont/)(cont 9 ). 

Proof. This follows easily from our preparatory work. Write 

Ug) = ((cont/)(/)) ((cont 9 )(£)) = (cont/)(cont fl ) (fg), 

with /, g primitive. By Lemma 4.4, fg_ is primitive; by Lemma 4.7 it follows that 
(cont/)(cont 9 ) is the content of fg, as needed. □ 

Note the following immediate consequence: 

Corollary 4.9. Let R. be a UFD, and let f,g € /J[sj. Assume (/) C (5). Then 
(cont/) C (contg). 

4.2. The field of fractions of an integral domain. Gauss’s lemma is the key 
to the important observation that if R. is a UFD, then so is f?[z). However, we need 
one more tool before we can prove this fact; this is one instance of the important 
process of localization. 

In the case we need for our immediate goal, the process starts from any integral 
domain and produces a field, in precisely the same way the field Q of rational 
numbers may be obtained from the integral domain Z. This construction is known 
as the field of fractions (or field of quotients) of the integral domain R. It satisfies 
the following beautiful universal property. 

Given an integral domain I?, consider the category 31 whose objects are pairs 

(*.*). 

where K is a field and i : R K is an injective ring homomorphism. Morphisms 
are defined in the style of every analogous construction we have encountered: that is, 
a morphism (i, K) —* (J, L) is determined by a homomorphism of fields a : K —> L 
making the following diagram 


K — 2 — *L 



commute. 
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Note that a, as every homomorphism of fields, is necessarily injective (Exer- 
cise III.3.10); this is compatible with the requirement that i : R «-+ K be injective 
to begin with. The injectivity of R *-+ K also forces R to be an integral domain, 
since subrings of fields are necessarily integral domains. 

Definition 4.10. The field of fractions K(R ) of R is an initial object of the cate- 
gory !%. -i 


Thus, K(R) is the ‘smallest field containing R\ 

The usual caveats apply to any such definition: the initial object of M carries 
not only the information of a field K (the field of fractions proper), but also of a 
specific realization of R. as a subring of K\ this distinction is blurred in common use 
of the language. Also, of course this prescription only defines the field of fractions 
up to isomorphism (as is true of any universal ohject; cf. Proposition 1.5.4). 

Finally, just stating the universal property does not guarantee that the sought- 
for initial object exists. Thus, our next task is the construction of an initial object 
for S£. Fortunately, the construction is essentially straightforward: we just need to 
formalize a notion of ‘fraction’ of elements of R. 

Consider the set R x (R*) of pairs (a,r) of elements of R, where r ^ 0. The 
pair (a, r) will be associated with a ‘fraction’ the reader would be well-advised 
to put this book away now and carry out the construction of K(R) on his/her own, 
profiting from this major hint. 

Here is the construction. Denote by - the equivalence class of (a,r) e Rx(R*) 

r 

with respect to the following equivalence relation: 


(a,r) ~ (6, s) <*=>■ as — br = 0. 

The reader will verify that this is indeed an equivalence relation. As a set, K(R) 
is defined by 

K(R) := {^ |a € R,r € R,r ± oj . 

It is clear that these ‘fractions’ behave much as ordinary fractions. For example, 

as _ a 
rs r 


if s 0: indeed, 


(oa)r = a(rs) 

by associativity and commutativity in R, and this shows 


(as, rs) rv {(I) 7*) 


as needed. 

We define operations on K(R) as follows: 

a b as + to- 

rs rs 
a b ab 

r s rs' 
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Of course one must verify that these operations are well-defined; this is also left to 
the reader. The fact that R is an integral domain is used in these definitions: it 
guarantees that rs ± 0 if r yS 0 and s ^ 0 . 

Now we claim that K(R) is made into a held by these operations. This is a 
straightforward verification; for example, distributivity amounts to the following 
computation: 

a fb c\ _ o ( 6 t + cs) _ a(bt + cs) _ a(bt) a(cs) _ ab ac 
r\s + t/ r st r(st) r(st) r(st) rs + rt 

_ ab ac 

r s rt 

The zero element is the fraction y (or in fact 2 for any nonzero r e R)\ the 
multiplicative identity is the fraction y (or in fact £ for any nonzero r € R). Since 

r s _ rs 1 
sr rs 1 

for all y ^ 5 (that is, for all r yS 0), every nonzero element in K{R) has an inverse, 
as promised. 

An ‘inclusion map’ i : R <-* K(R) is defined by 

a 

it is immediately checked that this is an injective ring homomorphism. It is common 
to use this map to identify R. with its isomorphic copy inside K(R) and simply 
view R as a subring of K(R). 

Claim 4.11. ( %,K(R )) is initial in&&. 


Proof. Let j : R <-* L be any injective ring homomorphism from R to a held L. 
We need to define an induced homomorphism j : K(R) — * L so that the diagram 


K — - — >L 



commutes, and we must show that j is unique. Now, the definition of j is in feet 
forced upon us: if j exists as a homomorphism, then necessarily 

J (7) = 3 (y)i (0 =3 (y)j (y) 1 = (9 «> *(«))(9 «* *(*-) _1 ) 

= j(a)j(r)~ I . 


Thus j is indeed unique, if it exists. On the other hand, the prescription 

3 (“) ~ 

does define a function K(R) — > L: indeed, if (a, r) ~ ( 6 , 9 ), then 

as—br 
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in R, hence 


jMjM = j(b)j(r) 

in L, and (note that j(r), j(a) are nonzero in L since r, s are nonzero in R and j is 
injective) 

i(a)j(r) _1 = j(b)j(s)~\ 

showing that the proposed j is weU-detined. The reader will verify that it is a ring 
homomorphism, concluding the proof of the claim. □ 


Example 4.12. With the notation introduced above, K(Z) = Q. 

The universal property implies immediately that F K(F) is an isomorphism 
if F is itself a field. Thus, the construction adds nothing to Q, R, C, Z/pZ, etc. 

If R. is any integral domain, so that J?[ae] is also an integral domain, is 

a famous field: 


Definition 4.13. The held of rational functions with coefficients in R. is the field 
of fractions of the ring /?[*]. This field is denoted R(x). j 


Elements of R,(x) are fractions of polynomials 

P(x) 

q(x) 

with p(x), q(x) € ,R[®] and q(x) ^ 0. The term function is inaccurate, since the 
‘function’ /?.—»/? given by a i-> is not defined for all a e R (not for those for 
which q(a) — 0, that is); and the function itself does not suffice to determine the 
element in R.(x ) (cf. Exercise III.2.7). j 


4.3. R XJFD =>■ f?[a:] UFD. We are now in a position to prove the analogue of 
Hilbert’s basis theorem for unique factorization domains, that is, 

Theorem 4.14. Let R be a UFD; then /?![*] is o UFD. 


For example, this result (and an immediate induction) show's that the rings 
1\x u . . . , a:„J and fcjaq, (for k a field) are UFDs. Theorem 4.14 is also often 

called Gauss’s lemma. 

As a measure of how delicate the statement of Theorem 4.14 is, note that the 
power series ring #([*)] is not necessarily a UFD if R is a UFD; examples of this 
phenomenon are however not easy to construct. Of course fc[[a:]] is a UFD if A: is a 
field, since fc[[zj] is a Euclidean domain in this case (Exercise 2.14). 

By Theorem 2.5, in order to prove Theorem 4.14, we have to verify that f?[r] 
satisfies the a.c.c. for principal ideals and that every irreducible element in /?[*] 
is prime, provided that R is itself a UFD. The general idea is to reduce these 
questions to matters in K\x\, where K = K(R) is the field of fractions of R: 
as we know, isT[ar] is a UFD (in fact it is a Euclidean domain, and Euclidean 
domain ==> PID => UFD, as shown in §2). 

The following lemma captures the most crucial ingredient of the interaction 
between and K\x\: 
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Lemma 4.15. Let R be a UFD, and let K = K(R ) be its field of fractions. For 
nonzero f,g 6 i£[x], denote by (/), (g) the principal ideals fR[x], in 

and denote by (f)x, (s)k the principal ideals fK[x], gK[x) in K\x\. Assume 

• (contg) C (cont/) and 

• (9)k £(/)*• 

Then (g) C (/). 

Proof. Since (q)k Q U)k, we have g = fh, where h € K[x\. Write h = £ft, where 
a,b € R. and h € R{x\ is a primitive polynomial: this can be done by collecting 
common denominators in h, then applying the first point of Lemma 4.7. We then 
have 

bg = afh 

in f?[x|. By Gauss's lemma and since ft is primitive, 

(acont/) = (bcontg); 

and since (cont ? ) C (cont/) by hypothesis, we obtain 

(acont/) C (6 cont/). 

Since R is an integral domain and (cont/) ^ (0), this implies 

a=bc 

for some c € f?.. But then ft = fft = eft € J?.[a;j, and fir = /ft € (/); that is, (g) C (/) 
in /?(«], as needed, □ 

The first application is the following description of the irreducible elements 
of this will also be used in the proof of Theorem 4.14 and is independently 
interesting. 

Proposition 4.16. Let R. be a UFD, and let K belts field of fractions. Let f 6 ,R[x] 
be a nonconstant , irreducible polynomial. Then f is irreducible as on element of 
K[x\. 

Proof. First note that / is primitive: otherwise we could factor out its content, 
and / would not be irreducible. 

Next, assume / = gh, with g,h€ K\x\, we have to prove that either g or ft is 
a unit in K\x]. Let c, d € K such that 

g = eg, ft = dh, 

and g, ft are primitive polynomials in /?[*•]. By Lemma 4.4, gh is also primitive; 
thus (cont fl fc) = (1) = (cont/); further, 

(f)K = ( 9h)fC 

as cd ± 0 is a unit in K. By Lemma 4.15 we obtain 

(/) = (£h) 

as ideals of /?[»]; that is, / = ugh with u € f?.[x] a unit. As / is irreducible in ii[x], 
this implies that either g or ft is a unit in ii[x]. But then g or ft were units in K[x], 
verifying that / is irreducible in K\x}. □ 
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We have always found this fact almost counterintuitive: K is ‘larger’ than R, 
so one may expect that it should be ‘easier’ to factor polynomials over K than over 
R. Proposition 4.16 tells us that this is not the case: with due attention to special 
cases, irreducibility in f?[xj is ‘the same as’ irreducibility in K[x]. To be precise, 

Corollary 4.17. Let R be a UFD and K the field of fractions of R.. Let f € f?.[x] 
be a nonconstant polynomial. Then f is irreducible in R{x] if and only if it is 
irreducible in K[x J and primitive. 

The proof amounts to tying up loose ends, and we leave it to the reader (Ex- 
ercise 4.21). 

We can now prove the main result of this section. We will make systematic use 
of the characterization of UFDs found in Theorem 2.5. 

Proof of Theorem 4.14. We begin by verifying the a.c.c. for principal ideals 
in R[x], Let 

(/l)C(/ 2 )C(/ 3 )C... 

be an ascending chain of principal ideals of By Corollary 4.9, this induces an 
ascending drain of principal ideals 

(cont/,) C (cont/j) C (cont/ 5 ) C • ■ • 

in R\ since R is a UFD, this chain stabilizes: that is, (cont/,) = (cont/. +l ) for 16 
i 3> 0. On the other hand, with notation as in Lemma 4.15 we have 

(/i )k £ (h)K £ (/3)k £ • - • 

as a sequence of ideals in since K[x\ is a UFD (because it is a PID; cf. §2.3) 
this sequence stabilizes. Therefore, (fi)K = {fi+i)ic for i > 0. 

By Lemma 4.15, (/j) = (/i+i) for i » 0; that is, the given chain of principal 
ideals stabilizes, as needed. 

Next, we consider an irreducible element / of jR[*]. We verify that (/) is a 
prime ideal; by Theorem 2.5 it then follows that J?[x] is a UFD, as stated. 

If / is irreducible and constant, then / is prime in R as R is a UFD, and it 
follows that / is prime in (by Corollary 4.2). Thus we may assume that / is 
nonconstant and irreducible (and in particular primitive) in R\x]. 

By Proposition 4.16, f is irreducible as an element of K\x)\ since K[x) is a PID, 
(f)fC is prime in AT(®]. Consider the composition 

p : /?(»] ^ *:[*] -» : 

We claim that ker p = (/). Indeed, the inclusion 3 is trivial; for the other inclusion, 
note that p(g) = 0 implies that g is divisible by / in K[x ]: that is, (g)K £ {f)x\ and 
we have (cout s ) C (cont/) since (cont/) = (1) as / is primitive. By Lemma 4.15, we 
obtain that (g) C (/), i.e., g is divisible by / in f?.[x] , as needed. Since kerp = (/), 
we find that p induces an injective homomorphism 

, K\x) 

(/) ( f)K 

16< For i O’ is shorthand for (3 N > 0) (Vi > N) ... , that is, ‘for all sufficiently large i...\ 
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Since the ring on the right is an integral domain (as (f)fc is prime in K[x\), so is 
the ring on the left. This proves that (/) is prime in i?[x], and we are done. □ 

Summarizing, if B. is a UFD, then factorization in f?[x] is ‘the same as’ factor- 
ization in the polynomial ring K\x] over the held of quotients of f?.. If f(x) £ 
then f(x) has a prime factorization in [x] for the simpler reason that K[x] is a 
PID, hence a UFD; but if f? itself is a UFD, then we know that each of the factors 
may be assumed to be in i?[as] to begin with (cf. Exercise 4.23). 

Example 4.18. As mentioned already, Theorem 4.14 implies that several impor- 
tant rings, such as Z(xi, or Cf.-Cj, . . . ,x„), are UFDs; for example Z[x] is a 

UFD, as announced in §2.3. Further, arguing as in Exercise 1.15 to reduce to the 
case of finitely many indetenninates, it follows that Z[x i , « 2 , • • ■ ] is a UFD: this is 
an example of a non-Noetherian UFD, promised a while back (and illustrating the 
last missing feature of the picture presented at the beginning of the chapter). j 


Exercises 


4.1. > Prove Lemma 4.1. (§4.1] 

4.2. Let J? be a ring, and let I be an ideal of B. Prove or disprove that if I is 
maximal in R., then IR[x] is maximal in /?[»]. 

4.3. > Let B. be a PID, and let / 6 i?[x]. Prove that / is primitive if and only if it 
is very primitive. Prove that this is not necessarily the case in an arbitrary UFD. 

[§4-l] 

4.4. > Let J? be a commutative ring, and let f,ff£ Prove that 

ftj is very primitive -4=4- both / and g are very primitive. 

[§4-l] 

4.5. > Prove Lemma 4.7. j§4.1] 


4.6. Let B be a PID, and let K be its field of fractions. 

• Prove that every element c € K can be written as a finite sum 

where the pi are nonassociate irreducible elements in R., rj > 0, and as,pi are 
relatively prime. 

• If = yi j are two such expressions, prove that (up to reshuffling) 




Pi = Ti = s it and Oi = bi modp^ . 

Relate this to the process of integration by ‘partial fractions’ you learned about 
when you took calculus. 
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4.7. > A subset S of a commutative ring R is a multiplicative subset (or multiplica- 
tively closed) if (i) 1 e S and (ii) s, t e S =>■ st e S. Define a relation on the set 
of pairs (a, s) with a e R-, s e S as follows: 

(a, s) ~ (a', s') <=► (3t e S), t(s'a - so') = 0. 

Note that if R is an integral domain and S = f? \ {0}, then S is a multiplicative 
subset, and the relation agrees with the relation introduced in §4.2. 

• Prove that the relation ~ is an equivalence relation. 

• Denote by ~ the equivalence class of (o,s), and define the same operations +, • 
on such ‘fractions’ as the ones introduced in the special case of §4.2. Prove that 
these operations are well-defined. 

• The set S~ 1 R. of fractions, endowed with the operations +, •, is the localization 17 
of R at the multiplicative subset S. Prove that S~ l R is a commutative ring and 
that the function oh! defines a ring homomorphism i : R.—* S~ 1 R. 

• Prove that £(s) is invertible for every s 6 S. 

• Prove that R — » S~ l R is initial among ring homomorphisms f \ R —> R 1 such 
that f(s) is invertible in R! for every s € S. 

• Prove that S -1 f? is an integral domain if R is an integral domain. 

• Prove that S~ l R. is the zero-ring if and only if 0 6 S. 

[4.8, 4.9, 4.11, 4.15, VII.2.16, VHI.1.4, VIII.2.5, VIII.2.6, VIII.2.12, §IX.9.1] 

4.8. -i Let S be a multiplicative subset of a commutative ring R, as in Exercise 4.7. 
For every /?-module M, define a relation ~ on the set of pairs (m, a), where me M 
and 8 e S: 

(m, s) ~ (m', s') ■<=$■■ (3t e S), t(s'm — sm') = 0. 

Prove that this is an equivalence relation, and define an module structure 

on the set S~ l M of equivalence classes, compatible with the /Z-module structure 
on M. The module S~ l M is the localization of M at S. [4.9, 4.11, 4.14, VIII.1.4, 
VIII.2.5, VIII.2.6] 

4.9. -i Let S be a multiplicative subset of a commutative ring f?, and consider the 
localization operation introduced in Exercises 4.7 and 4.8. 

• Prove that if / is au ideal of R such that InS = 0, then 18 /* := S~ l I is a proper 
ideal of S-'R.. 

• If £ : R — * S~ l R. is the natural homomorphism, prove that if J is a proper ideal 
of S~ 1 R., then J c := f -1 (<7) is am ideal of R. such that Jfl S = 0. 

• Prove that (J')* = J, while (/ e ) c = {a € f?.| (3s € S) sa € J}. 

• Find an example showing that (7') c need not equal /, even if / n 8 = <b. (Hint: 
Let S = {1, x, x 2 , . . . } in R. = C[x, t/]. What is (J e ) c for I — ( xy )?) 

[4.10, 4.14] 

17 The terminology is motivated by applications to algebraic geometry; see Exercise VI1.2.17. 
1 “The superscript e stands for ‘extension* (of the ideal from a smaller ring to a larger ring); 
the superscript c in the next point stands for Contraction 7 , 
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4.10. -> With notation as in Exercise 4.9, prove that the assignment p i-* S _1 p 
gives an inclusion-preserving bijection between the set of prime ideals of R disjoint 
from S and the set of ideals of S~ l R. (Prove that (p e ) c = p if p is a prime ideal 
disjoint from S.) [4.16] 

4.11. (Notation as in Exercise 4.7 and 4.8.) A ring is said to be local if it has a 
single maximal ideal. 

Let R be a commutative ring, and let p be a prime ideal of R. Prove that the 
set S = R v. p is multiplicatively closed. The localizations S~ l R, S -1 M are then 
denoted Rp, Mp. 

Prove that there is an inclusion-preserving bijection between the prime ideals 
of R t and the prime ideals of R contained in p. Deduce that Rp is a local ring 19 . 
[4.12, 4.13, VI.5.5, VII.2.17, VIII.2.21) 

4.12. -i (Notation as in Exercise 4.11.) Let R be a commutative ring, and let M 
be an R-module. Prove that the following are equivalent 20 : 

• M = 0. 

• M p = 0 for every prime ideal p. 

• M m = 0 for every maximal ideal m. 

(Hint: For the interesting implication, suppose that M ^ 0; then the ideal {r € 
R | (Vm € M),rm = 0} is proper. By Proposition 3.5, it is contained in a maximal 
ideal m. What can you say about M m ?) [VIII.1.26, VIII.2.21] 

4.13. -< Let A: be a field, and let v be a discrete valuation on k. Let R. be the 
corresponding DVB., with local parameter t (see Exercise 2.20). 

• Prove that R is local (Exercise 4.11), with maximal ideal m = (t). (Hint: Note 
that every element of R n m is invertible.) 

• Prove that k is the Held of fractions of R. 

• Now let A be a PED, and let p be a prime ideal in A. Prove that the localization 
Ap (cf. Exercise 4.11) is a DVR. (Hint: If p = (p), define a valuation on the field 
of fractions of A in terms of ‘divisibility by p\) 

[Vn.2.18] 

19 We have the following picture in mind associated with the two operations R/P, Rp for a 
prime ideal P : 



Can you make any sense of this? 
or) 

' The way to think of this type of results is that a module M over R is zero if and only if it 
is zero ‘at every point of Spec R\ Working In the localization Mp amounts to looking ‘near’ the 
point p of Spec R\ this is what is local about localization. As in this exercise, one can often detect 
‘global’ features by looking locally at every point. 
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4.14. With notation as in Exercise 4.8, define operations N JV e and N i-» N c 
for submodules N C M, N C S~ 1 M, respectively, analogously to the operations 
defined in Exercise 4.9. Prove that (N c ) e = N. Prove that every localization of a 
Noetherian module is Noetherian. 

In particular, all localizations S~ 1 R of a Noetherian ring are Noetherian. 

4.15. -> Let J? be a UFD, and let 5 be a multiplicatively closed subset of R (cf. Ex- 
ercise 4.7). 

• Prove that if q is irreducible ill iJ, then q/1 is either irreducible or a unit in 
S-'R. 

• Prove that if a/s is irreducible in S~ 1 R, then a/s is an associate of q/l far some 
irreducible element q of R. 

• Prove that S -1 J? is also a UFD. 

[4.16] 

4.16. Let R be a Noetherian integral domain, and let s € R, s ^ 0, be a prime 
element. Consider the multiplicatively closed subset S = {1, s,« 2 , . . . }. Prove that 
R is a UFD if and only if S~ y R is a UFD. (Hint: By Exercise 2.10, it suffices to 
show that every prime of height 1 is principal. Use Exercise 4.10 to relate prime 
ideals in R. to prime ideals in the localization.) 

On the basis of results such as this and of Exercise 4.15, one might suspect 
that being factorial is a local property, that is, that R is a UFD if and only if R p 
is a UFD for all primes p, if and only if R m is a UFD for all maximals m. This 
is regrettably not the case. A ring R is locally factorial if i? m is a UFD for all 
maximal ideals m; factorial implies locally factorial by Exercise 4.15, but locally 
factorial rings that are not factorial do exist. 

4.17. i> Let F be a field, and recall the notion of characteristic of a ring (Def- 
inition III.3.7); the characteristic of a field is either 0 or a prime integer (Exer- 
cise III.3.14.) 

• Show that F has characteristic 0 if and only if it contains a copy of Q and that 
F has characteristic p if and only if it contains a copy of the field Z/pZ. 

• Show that (in both cases) this determines the smallest subfield of F; it is called 
the prime subfield of F. 

[§5.2, svn.i.1] 

4.18. -> Let R be an integral domain. Prove that the invertible elements in R\x] 
are the units of R, viewed as constant polynomials. [4.20] 

4.19. t> An element a e R in a ring is said to be nilpotent if a" = 0 for some n > 0. 
Prove that if a is nilpotent, then 1 + a is a unit in R. [VI. 7. 11, §VH.2.3] 

4.20. Generalize the result of Exercise 4.18 as follows: let R. be a commutative 

ring, and let f = a<) + a t x H (- aax d € JlfaJ; prove that / is a unit in J?[a:] if and 

only if ao is a unit in R and a t , ... ,a<t are nilpotent. (Hint: If ho ■+• bix 4 f 6 e #' 

is the inverse of /, show by induction that o^ +I b e _j = 0 for all i > 0, and deduce 
that ad is nilpotent.) 
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4.21. o Establish the characterization of irreducible polynomials over a UFD given 
in Corollary 4.17. [§4.3] 

4.22. Let k be a field, and let /, g be two polynomials in k\x, y] = /c[z][yj . Prove that 
if / and g have a nontrivial common factor in fc(x)[j/], then they have a nontrivial 
common factor in fc[®, j/j. 

4.23. t> Let R be a UFD, K its field of fractions, f(x) € J?[z], and assume f(x) = 
ct(x)(3{x) with q( x), j3(x) in if [a;]. Prove that there exists a c € if such that 
cc«(a:) e c~ l fi(x) e R\x\, so that 

/(i) = (co{x))(c- 1 0{x)) 

splits f(x) as a product of factors in JJ[x]. 

Deduce that if a(x)/3(x) = f(x) € U[x] is monic and a(x) € K[x] is monic, 
then «(*), 0(x) are both in i?[ar) and fi{x) is also monic. [§4.3, 4.24, §VII.5.2) 

4.24. In the same situation as in Exercise 4.23, prove that the product of any 
coefficient of a with any coefficient of /? lies in R. 

4.25. Prove Fermat’s last theorem for polynomials: the equation 

f* + g n = h n 

has no solutions in C[f] for ra > 2 and /, g, h not all constant. (Hint: First, prove 
that /, g, h may be assumed to be relatively prime. Next, the polynomial 1 - t n 
factorizes in C[f] as n"=i (1 — CO for C = e 2 ”/"; deduce that /“ = Ili=i (f* — 

Use unique factorization in C[f] to conclude that each of the factors h— Cs is an 
n-th power. Now let h - g = a", h — Qg = b n , h - C?g = c" (this is where the n > 2 
hypothesis enters). Use this to obtain a relation (A a)* 1 + (pb)" = (i/c)", where A, 
ji, v are suitable complex numbers. What’s wrong with this?) 

The same pattern of proof would work in any environment where unique factor- 
ization is available; if adjoining to Z an nrth root of 1 lead to a unique factorization 
domain, the full-fledged Fermat’s last theorem would be as easy to prove as indi- 
cated in this exercise. This is not the case, a fact famously missed by G. Lame as 
he announced a ‘proof’ of Fermat’s last theorem to the Paris Academy on March 1, 
1847. 


5. Irreducibility of polynomials 

Our work in §4.3, especially Corollary 4.17, links the irreducibility of polynomi- 
als over a UFD with their irreducibility over the corresponding field of fractions. 
For example, irreducibility of polynomials in Z[x] is ‘essentially the same’ as irre- 
ducibility in Q[x]. This can be useful in both directions, provided we have ways 
to effectively verify irreducibility of polynomials over one kind of ring or the other. 
We collect here several remarks aimed at establishing whether a given polynomial 
is irreducible and briefly discuss the notion of algebraically closed field. 
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5.1. Roots and reducibility. Let R. be a ring and / 6 R[x]. An element a € R 
is a root if /(a) = 0. Recall (Example IIL4.7) that a polynomial f(x) 6 R[x] is 
divisible by {x — a) if and only if a is a root of /. More generally, we say that a is a 
root of / with multiplicity r if (x — a) T divides / and ( x — a) T+1 does not divide r. 
The reader is probably familiar with this notion from experience with calculus. For 
example, the graph of polynomials with roots of multiplicities 1, 2, and 3 at 0 look 
(near the origin), respectively, like 





Lemma 5.1. Let R be an integral domain, and let f € i?[x] be a polynomial of 
degree n. Then the number of roots of f, counted with multiplicity, is at most n. 

Proof. The number of roots of / in R is less than or equal to the number of roots 
of / viewed as a polynomial over the field of fractions if of if; so we may replace R 
by if. 

Now, K[x] is a UFD, and the roots of / correspond to the irreducible factors 
of / of degree 1. Since the product of all irreducible factors of f has degree n, the 
number of factors of degree 1 can be at most n, as claimed. □ 

It is worth remembering the trick of replacing an integral domain R by its field 
of fractions K, used in this proof: one is often able to use convenient properties 
due to the fact that if is a field (the fact that if [a:] is a UFD, in this case), even 
if R is very far from satisfying such properties (!?[*] may not be a UFD, since R 
itself need not be a UFD). 

Also note that the statement of Lemma 5.1 may fail if i? is not an integral 
domain. For example, the degree 2 polynomial x 2 + x has four roots over Z/6Z. 
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The innocent Lemma 5.1 has important applications: for example, we used 
it (without much fanfare) in the proof of Theorem IV .6.10. Also, recall that a 
polynomial / € !?[*) determines an ‘evaluation functions’ (cf. Example III.2.3) 
R —* R, namely r t-t /(r). The reader checked in Exercise III.2.7 that in general 
the function does not determine the polynomial; but polynomials are determined 
by the corresponding functions over infinite integral domains: 

Corollary 5.2. Let R be an infinite integral domain, and let f,g e f?[sc] be poly- 
nomials. Then f = g if and only if the evaluation functions r t-+ f(r), r »-*• g(r) 
agree. 

Proof. Indeed, the two functions agree if and only if every a € R is a root of 
/ — g\ but a nonzero polynomial over R cannot have infinitely many roots, by 
Lemma 5.1. □ 

Clearly, an irreducible polynomial of degree > 2 can have no roots. The con- 
verse holds for degree 2 and 3, over a field: 

Proposition 5.3. Let k be a field. A polynomial f € b[a:] of degree 2 or 3 is 
irreducible if and only if it has no roots. 

Proof. Exercise 5.5. □ 

Example 5.4. Let F 2 be the field Z/2Z. 

The polynomial f(t) = t 2 +• t + 1 € F 2 jt] is irreducible, since it has no roots: 
/(0) = /(l) = 1. Therefore the ideal (< 2 + t + 1) is prime in F 2 (<], hence maximal 
(because F 2 [f] is a PID: don’t forget Proposition III.4.13). This gives a one-second 
construction of a field with four elements: 

(< 2 + t + i)' 

The reader (hopefully) constructed this field in Exercise III. 1. 1 1, by the tiresome 
process of searching by hand for a suitable multiplication table. The reader will 
now have no difficulty constructing much larger examples (cf. Exercise 5.6). -1 

Proposition 5.3 is pleasant and can help to decide irreducibility over more 
general rings: for example, a primitive polynomial / € Z[x] of degree 2 or 3 is 
irreducible if and only if it has no roots in Q. Indeed, / is irreducible in Z[a;] 
if and only if it is irreducible in Q[»] (by Corollary 4.17). However, note that 
e.g. Ax 2 — 1 = (2a: — l)(2ar + 1) is primitive and reducible in Z[ar], although it has 
no integer roots. Also, keep in mind that the statement of Proposition 5.3 may fail 
for polynomials of degree > 4: for example, x 4 + 2x 2 + 1 = (a: 2 + 1) 2 is reducible in 
Q[ar], but it has no rational roots. 

Looking for rational roots of a polynomial in Z[x] is in principle a finite en- 
deavor, due to the following observation (which holds over every UFD). This is 
often called the ‘rational root test’. 

Proposition 5.5. Let R be a UFD, and let K be its field of fractions. Let 
f(x) = «o + a\x A 1- a n x n € /?[x), 
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and let c = | € K be a root of f, with p,q € R, gcd(p,q) = 1. Then p \ oq and 
q | «n in R.. 

Proof. By hypothesis, 

P P n 

ao + oi - H 1- a* — = 0; 

that is, 

«o<f + aipg n_1 H + «„p n = 0. 

Therefore 

a 0 q n = 

proving that p | («o9 n )- Since the gcd of p and q is one, this implies that the 
multiset of factors of p is contained in the multiset of irreducible factors of ao, that 
is, p | ao- 

An entirely similar argument proves that q | a„. □ 

Example 5.6. Looking for rational roots of the polynomial 
3 — 2a: + 3z 2 — 2a: 3 + 3s 4 — 2a; 5 

is therefore reduced to trying fractions | with q = ±1, ±2, p = ±1, ±3. As it 
happens, | is the only root found among these possibilities, and it follows that it 
is the only rational root of the polynomial. j 

5.2. Adding roots; algebraically closed fields. A homomorphism of fields 

i:fc->F 

is necessarily injective (cf. Exercise 1H.3.10): indeed, its kernel is a proper ideal 
of k, and the only proper ideal of a field is (0). In this situation we say that F 
(or more properly the homomorphism i) is an extension of Is. Abusing language a 
little, we then think of it as contained in F: k C F. Keep in mind that this is the 
case as soon as there is any ring homomorphism k —* F. 

There is a standard procedure for constructing extensions in which a given 
polynomial acquires a root; we have encountered an instance of this process in 
Example III.4.8. In fact, the resulting field is almost universal with respect to this 
requirement. 

Proposition 5.7. Let k be a field, and let f(t) € A[t] be a nonzero irreducible 
polynomial. Then 

F . = WL 

(/(*» 

is a field, endowed with a natural homomorphism i : k F (obtained as the 
composition k — * fc|a:] — * F) realizing it as an extension of k. Farther, 

• /( x) € fc(») C F[a:] has a root in F, namely the coset of t; 
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• ifkCKis any extension in which f has a root, then there exists a homomor- 
phism j : F — + K such that the diagram 



F 


commutes. 


Proof. Since k is a field, fc[t] is a PID; hence (/(f)) is a maximal ideal of fc[f], by 
Proposition III.4.13. Therefore F is indeed a field. Denoting cosets in k\t]/(f(t)) = 
F by underlining, we have 

/(f) = /(f) = 0, 

as claimed. 

To verify the second part of the statement, suppose k C K is an extension and 
/(«) = 0, with u e K. This means that the evaluation homomorphism 


e : k[t] - K 


defined by e(g(t)) := g{u) vanishes at /(f); hence (/(f)) C ker(e), and the universal 
property of quotients gives a unique homomorphism 


J-F = 


fe [f] 

(/(f)) 


K 


satisfying the stated requirement. 


□ 


We have stopped short of claiming that the extension constructed in Propo- 
sition 5.7 is universal with respect to the requirement of containing a root of / 
because the homomorphism j appearing in the statement is not unique: in fact, 
the proof shows that there are as many such homomorphisms as there are roots of 
/ in the larger field K. We could say that F is versa!, meaning ‘universal without 
uni(queness)’. We would have full universality if we included the information of the 
root of /; we will come back to this construction in Chapter VII, when we analyze 
field extensions in greater depth. 

Example 5.8. For k = R and f(x) = x 2 +l, the field constructed in Proposition 5.7 
is (isomorphic to) C: this was checked carefully in Example IU.4.8. 

Similarly, Q(f]/(f 2 — 2) produces a field containing Q and in which there is a 
‘square root of 2’. There are two embeddings of this field in R, because R contains 
two distinct square roots of 2: ±\/2. _i 

The fact that the irreducible polynomial / € fc(x] acquires a root in the exten- 
sion F constructed hi Proposition 5.7 implies that / has a linear (i.e., degree 1) 
factor over F; in particular, if deg(/) > 1, then / is no longer irreducible over F. 

Given any polynomial / e A:(arJ, it is easy to construct an extension of k in 
which / factors completely as a product of linear factors (Exercise 5.13). The case 
in which this already happens in k itself for every nonzero / is very important, and 
hence it is given a name. 
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Definition 5.9. A field k is algebraically closed if all irreducible polynomials in 
have degree 1. _i 

We have encountered this notion in passing, back in Example III.4.14 (and 
Exercise HI.4.21). The following lemma is left to the reader: 

Lemma 5.10. A field k is algebraically closed if and only if every polynomial 
f G fc[x] factors completely as a product of linear factors, if and only if every 
nonconstant polynomial f G fc[x] has a root in k. 

In other words, a field k is algebraically closed if the only polynomial equations 
with coefficients in k and no solutions are equations ‘of degree O’, such as 1 = 0. 
The result known as JVt dlstellensatz (also due to David Hilbert; we have mentioned 
this result in Chapter III) is a vast generalization of this observation and one of 
the pillars of (old-fashioned) algebraic geometry. We will come back to all of this 
in §VIL2. 

Is any of the finite fields we have encountered (such as Z /pZ, for p prime) 
algebraically closed? No. Adapting Euclid’s argument proving the infinitude of 
prime integers (Exercise 2.24) reveals that algebraically closed fields are infinite. 

Proposition 5.11. Let k be an algebraically closed field. Then k is infinite. 

Proof. By contradiction, assume that k is algebraically closed and finite; let the 
elements of k be ci, . . . , cjy. 

Then there are exactly N irreducible monic polynomials in fc[x), namely (a: - 
ci), . . . , (x — c/v). Consider the polynomial 

f(x) = (x-ci) (a: - c N ) + 1 : 

for all c G k we have 

/(c) = (c- ci) (c- c N ) + 1 = 1 4- 0, 

since c equals one of the Cj’s. Thus f(x) is a nonconstant polynomial with no roots, 
contradicting Lemma 5.10. □ 

Since finite fields are not algebraically closed, the reader may suspect that 
algebraically closed fields necessarily have characteristic 0 (cf. Exercise 4.17). This 
is not so: there are algebraically closed fields of any characteristic. In fact, as we will 
see in due time, every field F may be embedded into an algebraically closed field; 
the smallest such extension is called the ‘algebraic closure’ of F and is denoted F. 
Thus, for example, Z/2Z is an algebraically closed field of characteristic 2. 

The algebraic closure Q is a countable subfield of C. The extension QC Q, 
and especially its ‘Galois group’, cf. § VII.6, is considered one of the most important 
objects of study in mathematics (cf. the discussion following Corollary VII.7.6). 
We will construct the algebraic closure of a field rather explicitly, in §VII.2.1. 

5.3. Irreducibility in C[x], R[a:], Q[x]. Every polynomial / G C[x] factors com- 
pletely over C. Indeed, 

Theorem 5.12. C is algebraically closed. 
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Gauss is credited with providing the first proof 21 of this fundamental theorem 
(which is indeed known as the fundamental theorem of algebra.) 

‘Algebraic’ proofs of the fundamental theorem of algebra require more than we 
know at this point (we will encounter one in §VII.6, after we have seen a little Galois 
theory); curiously, a little complex analysis makes the statement nearly trivial. Here 
is a sketch of such an argument. Let / € C[x] be a noncoustant polynomial; the 
task (cf. Lemma 5.10) consists of proving that / has a root in C. Whatever /( 0) 
is, we can find an r G R large enough that |/(z)| > |/(0)| for all z on the circle 
\z\ = r (since / is noncoustant, lhn 2 _ 00 |/(z)| = +oo). The disk |z| < r is compact, 
so the continuous function |/(z)| has a minimum on it; by the choice of r, it must 
be somewhere in the interior of the disk, say at z = a. The minimum modulus 
principle (cf. Exercise 5.16) then implies that /(«) = 0, q.e.d. 

By Theorem 5.12, irreducibility of polynomials in C[x) is as simple as it can 
be: a nonconstant polynomial / G <C(ar) is irreducible if and only if it has degree 1. 
Every / G C[x] of degree > 2 is reducible. 

Over R, the situation is almost as simple: 

Proposition 5.13. Every polynomial f G R[x) of degree > 3 is reducible. 

The nonconstant irreducible polynomials in R[x] are precisely the polynomials 
of degree 1 and the quadratic polynomials 

f = ax 2 + bx + c 

with b 2 — 4ac < 0. 

Proof. Let / G R[x] be a nonconstant polynomial; 

/ = ao + ai* + • • ■ + 

with all a; G R. By Theorem 5.12, / has a complex root z: 

ao +aizH t-On*" = 0. 

Applying complex conjugation z<-*z and noting that a,- = a; since a* G R, 

ao +aizH ha«z n — ao + aTz H ha^"z" = ao + «iz H ba n z n = 0 : 

this says that 3 is also a root of /. There are two possibilities: 

• either z = If, that is, z = r was real to begin with, and hence (x—r) is a factor 
of / in R[x]; or 

• z ,4 z, and then (x — z) and (x — 3) are nonassociate irreducible factors of / 
in C{ar). In this case (since Cjx] is a UFD!) 

x 2 — (z + z)a s + z3 = ( x — z)(x — 3) 

divides /. 


21 Actually, Gauss's first proof apparently had a gap; the first rigorous proof is due to Argand. 
Later, Gauss fixed the gap in his proof and provided several other proofs. 
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Since both z + z said zz are both real numbers, this analysis shows that every 
nonconstant / 6 R[x] has an irreducible factor of degree < 2, proving the first 
statement. 

The second statement then follows immediately from Proposition 5.3 and the 
fact that the quadratic polynomials in R[x] with no real roots are precisely the 
polynomials ax 2 + bx + c with b 2 - 4ac <0. □ 

The following remark is an immediate consequence of Proposition 5.13 and is 
otherwise evident from elementary' calculus (Exercise 5.17): 

Corollary 5.14. Every polynomial f £ R[s] of odd degree has a real root. 

Summarizing, the issue of irreducibility of polynomials in C[a:] or R[x] is very 
straightforward. By contrast, irreducibility in Q[x] is a complicated business: as we 
have seen (Proposition 4.16), it is just as subtle as irreducibility in Z[x], There is 
no sharp characterization of irreducibility in Z[x\; but the following simple-minded 
remarks (and Eisensteiu’s criterion, discussed in §5.4) suffice in many interesting 
cases. 

One simple-minded remark is that if ip : R — » S is a homomorphism, and a 
is reducible in R., then <p(a) is likely to be reducible in S. This is jnst because if 
a= be, then <p(a) — tp(b)<p(c). 

Of course this is not quite right: for example because <p(a) and/or <p(b) and/or 
<p(c) may he units in 5. But with due care for special cases this observation may 
be used to great effect, especially in the contrapositive form: if (p(a) is irreducible 
in S, then a is (likely. . . ) irreducible in R. 

Here is a simple statement formalizing these remarks, for R = Z[x], The natural 
projection Z — > Z/pZ induces a surjective homomorphism tt : Z\x\ — » Z/pZfx]: 
quite simply, ir(f) is obtained from / by reading all coefficients modulo p; we will 
write ‘/modp’ for the result of this operation. 

Proposition 5.15. Let f G Z[x] be a primitive polynomial, and let p be a prime 
integer. Assume f modp has the some degree as f and is irreducible in Z/pZ[x\. 
Then f is irreducible in Z[x], 

Proof. Argue contrapositively: if / is primitive and reducible in Z[x] and deg / = 
n, then f = gh with deg g = d, deg h — e, d+ e = n, and both d, e, positive. But 
then the same can be said of /modp, so / modp is also reducible. □ 

The hypothesis on degrees is necessary, precisely to take care of the ‘special 
cases’ mentioned in the discussion preceding the statement. For example, (2s 3 + 
3x 2 + 32 + 1) equals (x 2 + x + 1) mod 2, and the latter is irreducible in Z/2Z[x]; 
yet (2x + 1) is a factor of the former. The point is of course that 2x + 1 is a unit 
mod 2. 

Warning: As we will see in a fairly distant future (Example VII.5.3), there 
are irreducible polynomials in Z[x\ which are reducible modulo all primes! So 
Proposition 5.15 cannot be turned into a characterization of irreducibility in Z[x]. 
It is however rather useful in producing examples. For instance, 
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Corollary 5.16. There are irreducible polynomials in Z[x] and Q[a;] of arbitrarily 
large degree. 

Proof. By Proposition 4.16, the statement for Z[x] implies the one for Q[a;]. By 
Proposition 5.15, it suffices to verify that there are irreducible polynomials in 
Z/pZ[x\ of arbitrarily large degree, for any prime integer p. This is a particular 
case of Exercise 5.11. □ 

5.4. Eisenstein’s criterion. We are giving this result in a separate subsection 
only on account of the fact that it is very famous; it is also very simple-minded, 
like the considerations leading up to Proposition 5.15, and further it is not really a 
‘criterion’, in the sense that it also does not give a characterization of irreducibility. 

The result is usually stated for Z, but it holds for every ring R. 

Proposition 5.17. Let R be a (commutative) ring, and let p be a prime ideal of 
R. Let 

/ = no + “ 1 * + • • • + o,,x” € #[a-] 
be a polynomial, and assume that 

• a-n 4- Pi 

• <n G p for i = 0, . . . ,n — 1; 

• «o £ P 2 . 

Then f is not the product of polynomials of degree <n in i?[x]. 

Proof. Argue by contradiction. Assume / = gh in J?[s], with both d = degp and 
e = deg h less than n = deg /; write 

p = &a+&iaH h=co + cia:H hc e a: e , 

and note that necessarily d > 0 and e > 0. Consider / modulo p: thus 

f_=gh in (H/p)[®], 
where / denotes / modulo p, etc. 

By hypothesis, / = a Ln x n modulo p, where a n ^ 0 in R./p. Since R./p is an 
integral domain, factors of / must also be monomials: that is, necessarily 

£ = kd x<i , h = £e* e - 

Since d > 0, e > 0, this implies &o € p, co € p. 

But then «o = k>co € p 2 , contradicting the hypothesis. □ 

For example, x 4 +2x 2 + 2 must be irreducible in Z\x\ (and hence in Q(a?J): apply 
Eisenstein’s criterion with R = Z, p = (2). More exciting applications involve whole 
classes of polynomials: 

Example 5.18. For all n and all primes p, the polynomial x n — p is irreducible 
in Z[x], This follows immediately from Eisenstein’s criterion and gives an alterna- 
tive proof of Corollary 5.16. j 
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Example 5.19. This is probably the most famous application of Eisenstein’s cri- 
terion. Let p be a prime integer, and let 

/(#) = l + *+« 2 H h a: 1 ’ -1 € Z[x\. 


These polynomials are called cyclotomic; we will encounter them again in §VIL5.2. 

It may not look like it, but Eisenstedn’s criterion may be used to prove that 
f(x) is irreducible. The trick (which the reader should endeavor to remember) is 
to apply the shift x — ► x + 1. It is hopefully dear that f(x) is irreducible if and 
only if f(x + 1) is; thus we are reduced to showing that 

f(x + 1) = 1 + (a: + 1) + (a: + l) 2 H + (x + l)'’ -1 


is irreducible. This is better than it looks: since f(x) = (x p — l)/(x - 1), 

Eisenstein’s criterion proves that this is irredudble, since (j) = p and 


Claim 5.20. For p prime and fc = 1, . . ,,p — 1, p divides 



There is nothing to this, because 

® V '■ 

fc!(p — k)l 

and p divides the numerator and does not divide the denominator; the daim follows 
as p is prime. It may be worthwhile noting that this fact does not hold if p is not 
prime: for example, 4 does not divide (j). _i 


Exercises 


5.1. Let f(x) 6 C[x] prove that a is a root of / with multiplicity r if and only 

if /(a) = /'(a) = •• • = /( r-I )(a) = 0 and f*- r \a) = 0, where /^(°) denotes the 
value of the k-th derivative of / at a. Deduce that f(x) e C\x\ has multiple roots 
if and only if gcd(/(x), # 1. [5.2] 

5.2. Let F be a subtield of C, and let f(x) be an irreducible polynomial in F[®]. 
Prove that f(x) has no multiple roots in C. (Use Exercises 2.22 and 5.1.) 

5.3. Let R be a ring, and let f(x) = a 0 a: 2n + r^x 2 " -2 H 1- 02 * 2 + Oo € F[x] be a 

polynomial only involving even powers of x. Prove that if g{x) is a factor of f(x), 
so is fl(— x). 

5.4. Show that .t 4 + ,t 2 + 1 is reducible in Z[xj. Prow that it has no rational roots, 
without finding its (complex) roots. 

5.5. t> Prove Proposition 5.3. [§5.1] 

5.6. > Construct fields with 27 elements and with 121 elements. [§5,1] 
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5.7. Let R be an integral domain, and let /( x) € jR[z] be a polynomial of degree d. 
Prove that f(x) is determined by its value at any d + 1 distinct elements of R. 

5.8. -> Let if be a field, and let ao , . • • , ad be distinct elements of K. Given any 
elements bo, ■ ■ . , 6<i in K , construct explicitly a polynomial f(x) € K\x] of degree d 
such that /(ao) = 6o> • • • , /(<x<i) = and show that this polynomial is unique. 
(Hint: First solve the problem assuming that only one fcj is not equal to zero.) This 
process is called Lagrange interpolation. [5.9) 

5.9. -i Pretend you can factor integers, and then use Lagrange interpolation (cf. Ex- 
ercise 5.8) to give a finite algorithm to factor polynomials 22 with integer coefficients 
over Qfarj. Use your algorithm to factor (x — l)(.r — 2)(x — 3)(x — 4) + 1. [5.10] 

5.10. Prove that the polynomial (a; — l)(a; — 2) • • • (x — n) — 1 is irreducible in Q[x] 
for all n > 1. (Hint: Think along the lines of Exercise 5.9.) 

5.11. t> Let F be a finite field. Prove that there are irredncible polynomials in F\x\ 
of arbitrarily high degree. (Hint: Exercise 2.24. Note that Example 5.16 takes care 
of Z/pZ, but what about all other finite fields?) (§5.3] 

5.12. Prove that applying the construction in Proposition 5.7 to an irreducible 
linear polynomial in fe[x] produces a field isomorphic to k. 

6.13. t> Let A: be a field, and let / £ k[a;] be any polynomial. Prove that there is an 
extension k C F in which / factors completely as a product of linear terms. (§5.2, 
§VI.7.3) 

5.14. How many different embeddings of the field Q[t]/(t 3 - 2) are there in R? 
How many in C? 

5.15. Prove Lemma 5.10. 

5.16. l> If you know about the ‘maximum modulus principle’ in complex analysis: 
formulate and prove the ‘minimum modulus principle’ used in the sketch of the 
proof of the fundamental theorem of algebra. [§5.3J 

5.17. > Let / e R[k) be a polynomial of odd degree. Use the intermediate value 
theorem to give an ‘algebra-free’ proof of the fact that / has real roots. (§5.3, 
§VII.7.1) 

5.18. Let / £ Z[x] be a cubic polynomial such that /( 0) and /( 1) are odd and 
with odd leading coefficient. Prove that / is irreducible in Q[i]. 

5.19. Give a proof of the fact that \/2 is not rational by using Eisenstein’s criterion. 

5.20. Prove that x 6 + 4a^ + 1 is irreducible by using Eisenstein’s criterion. 

5.21. Prove that 1 + a: + x 2 4 V s n_1 is reducible over Z if n is not prime. 

5.22. Let R be a UFD, and let a e R be an element that is not divisible by 
the square of some irreducible element in its factorization. Prove that x n — a is 
irreducible for every integer n > 1. 

22 It is in fact much harder to factor integers than integer polynomials. 
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5.23. Decide whether y 5 + x 2 y a + x 3 y 2 + x is reducible or irreducible in C[x,y]. 

5.24. Prove that C[x, y, z, w\/{xv>—yz) is an integral domain, by using Eisenstein’a 
criterion. (We used this ring as an example in §2.2, but we did not have the patience 
to prove that it was a domain back then!) 


6. Further remarks and examples 


6.1. Chinese remainder theorem. Suppose all you know about an integer is 
its class modulo several numbers; can you reconstruct the integer? Also, if you 
are given arbitrary classes in IjnL for several integers n, can you find an N € Z 
satisfying all these congruences simultaneously? 

The answer is no in both cases, for trivial reasons. If an integer N satisfies given 
congruences modulo rtj , . . . , nj, then adding any multiple of ra* • • • a* to N produces 
an integer satisfying the same congruences (so N cannot be entirely reconstructed 
from the given data); and there is no integer N such that N = 1 mod 2 and N = 
2 mod 4, so there are plenty of congruences which cannot be simultaneously satisfied. 

The Chinese remainder theorem (CRT) sharpens these questions so that they 
can in fact be answered affirmatively, and (in its modem form) it generalizes them 
to a much broader setting. Its statement is most pleasant for PIDs; it is very 
impressive 23 even in the limited context of Z. Protoversions of the theorem have 
already appeared here and there in this book, e.g.. Lemma IV.6.1. 

We will first give the more general version, which is in some way simpler. Let 
R be any commutative ring. 

Theorem 6.1. Let 7j, . . . , Ik be ideals of R such that Ii + Ij = (1) for alii j. 
Then the natural homomorphism 


<p : R. 



x 


R 

Ik 


is surjective and induces an isomorphism 


ip: 


R 

h-h 



x 


J? 

V 


The ‘natural’ homomorphism tp is determined by the canonical projections 
R — > R/Ij and the universal property of products; the homomorphism i p is induced 
by virtue of the universal property of quotients, since Ii -IkQlj for all j, hence 
fy • • ■ Jk C ker ip. In fact, dearly 

ker ip = fy C ' ‘ ' H Ik\ 

so the second part of the statement of Theorem 6.1 follows immediately from the 
first part, the ‘first isomorphism theorem’, and the following (independently inter- 
esting) observation: 

23 Allegedly, a large number of ‘Putnam’ problems can be solved by clever applications of 
the CRT. 
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Lemma 6.2. Let ij, . . . , Ik be ideals of R such that Ii + Ij = (1) for all i ^ j. 
'Then I\ • • • = I± n * ■ * n If.. 

Proof. A simple induction reduces the general statement to the case k = 2. There- 
fore, assume / and J are ideals of i?, such that I+J = (1). The inclusion IJCIOJ 
holds for all ideals J, J, so the task amounts toproving JnJ C IJ when I+J = (1). 

If I + J = (1), then there exist elements a 6 I, b € J such that a + b = 1. But 
if r S JnJ, then 

r = r ■ 1 = r(a + 6) = rn + rb € IJ : 

because ra € IJ as r € J and a € I, while rb 6 IJ as r € I and b € J. The 
statement follows. □ 

Thus, we only need to prove the first part of Theorem 6.1: the surjectivity of tp 
under the given hypotheses. The proof of this is a simple exercise but may be tricky 
to set up. The following remark streamlines it considerably: 

Lemma 6.3. Let It,..., Ik be ideals of R such that Ii + Ik = (1) for all i = 
1 fe — 1- (h - h-i) + h = (!)• 

Proof. By hypothesis, for * = 1, . . . , k - 1 there exists € Ik such that 1— ojt € 
Then 

(1 — Oi) ■ • • (1 — € Ii ■ ■ • Ik-1, 

and 

1 - (1 - ai) • • • (1 - ak-i) € Ik, 

because it is a combination of ai, . . . , € Ik- □ 

Proof of Theorem 6-1. Argue by induction on k. For k = 1, there is nothing to 
show. For k > 1, assume the statement is known for fewer ideals. Thus, we may 
assume that the natural projection induces an isomorphism 

R „ R .. R 

h -Ik-i ~ h * h- 1 ! 

and all that we have left to prove is that the natural homomorphism 

D R R 
h - h-i * Ik 

is surjective. By Lemma 6.3, (If ■ Ik- j) 4- h = (1); thus we are reduced to the 
case of two ideals. 

Let then I, J be ideals of a commutative ring R, such that I+J - (1), and let 
77, rj € R\ we have to verify' that 3r € R such that r = r/ mod J and r = rj mod J. 
Since I + J = (1), there are a e I, b € J such that a + b = 1. Let r = arj + brj: 
then 

r = arj + (1 — a)rj = r / + a(rj - rj) = rj mod I 

as a £ I, and 

r = (1 - b)rj + bri = rj + b(rj — rj) s rj mod J 
as b € J, as needed, and completing the proof. □ 
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In a PID, the CUT takes the following form: 

Corollary 6.4. Let R be a PID, and let € R be elements such that 

gcd(o.j, aj) = 1 for ail i ^ j. Let a = oi • ■ • a k . Then the function 

R R R 

¥>: (o)" > (« 1 ) X ' X (a fc ) 

defined by r + (a) >-* (r + (fli), . . . , r + (a*,)) is an isomorphism. 


This is an immediate consequence of Theorem 6.1, since (in a PID!) gcd(a, b) = 
1 if and only if (a, 6) — (1) as ideals. This is not the case for arbitrary UFDs, and 
indeed the natural map 

(2) 


Z[x\ 

(*) 


is not surjective (check this!), even thougli god (2,#) = 1. But the kernel of this 
map is (2x); and this is what can be expected in general from the CRT over UPDs 
(Exercise 6.6). 

As Z is a PID, Corollary 6.4 holds for Z and gives the promised answer to a 
revised version of the questions posed at the beginning of this subsection. In fact, 
tracing the proof in this case gives an effective procedure to solve simultaneous 
congruences over Z (or in fact over ary Euclidean domain, as will be apparent from 
the argument). 

lb see this more explicitly, let n \, . . . , n* be pairwise relatively prime integers, 
and let n = n\- ■ -n k ; for each i , let mj = n/n+. Then w* and m* are relatively 
prime 24 , and we can use the Euclidean algorithm to explicitly find integers «i, bi 
such that 


«i«i + biirii - 1. 


The numbers qi = k,-m i have the property that 


qi = 1 mod n j, qi = 0 mod nj Vj ^ i 

(why/). These integers may he used to solve any given system of congruences 
modulo indeed, if n , . . . , r* € Z are given, then 


N :=riqi-\ + r k qk 


satisfies 

N = r\ ■ 0 + t r, - 1 + h ry ■ 0 = r ,• mod raj 

for all i. 

The same procedure can be applied over any Euclidean domain R ; see Exer- 
cise 6.7 for an example. 


24 


'This is a particular case of Lemma 6.3 and is clear anyway from gcd considerations. 
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6.2. Gaussian integers. The rings Z and fc[x] (where fc is a field) may be the 
only examples of Euclidean domains known to our reader. The next most famous 
example is the ring of Gaussian integers; this is a very pretty ring, and it has 
elementary but striking applications in number theory (one instance of which we 
will see in §6.3). 

Abstractly, we may define this ring as 


m 


Z[x\ 

(x* + i y 


and we are going to verify that Z[i] is a Euclidean domain. 

The notation Z[i] is justified by the discussion in §5.2 (cf. especially Propo- 
sition 5.7): Z[i] may be viewed as the ‘smallest’ ring containing Z and a root of 
x 2 + 1, that is, a square root i of — 1. The natural embedding 


Z[x] R[») 

(r 2 + 1) “ (a; 2 + 1) 

and the identification of the rightmost ring with C (Example 111.4.8) realize Z[i] as 
a subring of C; tracing this embedding shows that we could equivalently define 


Z(i] — {a -f- bi € C | a, b € Z}. 

Thus, the reader should think of Z[i] as consisting of the complex numbers whose 
real and imaginary parts are integers; this may be referred to as the ‘integer lattice’ 
inC: 


4 * 






— 2 — i 


1 


♦ • 




• • 


• • 


This picture is particularly compelling, because it allows us to ‘visualize’ principal 
ideals in Z[t]: simple properties of complex multiplication show that the multiples 
of a fixed w € Z[i] form a regular lattice superimposed on Z[«]. fbr example, the 
fattened dots in the picture 
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• • • • 
• • • • 
• • • • 


-2 - i 

• • • • 

• . • • 

• • • • 
* • • • 


• • • • 
• • • • 
• • • • 
• • • • 


• • • • 
• • • • 
• • • • 
• • * • 


represent the ideal (—2 — *) in Z($J: an enlarged, tilted lattice superimposed on 
the integer lattice in C. The reader should stop reading now and make sure to 
understand why this works out so neatly. 

A Gaussian integer is a complex number, and as such it has a norm 

N(a + bi) := (o + bi)(a — bi) — a 2 + b 2 . 

Geometrically, this is the square of the distance from the origin to a + bi. 

The norm of a Gaussian integer is a noimegative integer; thus AMs a function 

Z[i] -» 

Lemma 6.5. The function N is a Euclidean valuation on Z [*]; further, N is mul- 
tiplicative in the sense that 'iz , w € Z[i] 

N(zw) = N(z)N(w). 


Proof. The multiplicativity is an immediate consequence of the elementary prop- 
erties of complex conjugation: 

N(zw) = (zw)(zw) = (zz) (ww) = N(z)N(w). 

To see that N is a Euclidean valuation, we have to show how to perform a ‘division 
with remainder’ in Z[»]. It is not hard, but it is a bit messy, to do this algebraically; 
we will attempt to oonvince the reader that this is in fact visually evident (and 
then the reader can have fun producing the needed algebraic computations). 

Let z, w € Z [*], and assume w ^ 0. The ideal (w) is a lattice superimposed 
on Z[i|. The given z is either one of the vertices of this lattice (in which case z is a 
multiple of w, so that the division z/w can be performed in Z(t], with remainder 0) 
or it sits inside one of the ‘boxes’ of the lattice. In the latter case, pick any of the 
vertices of that box, that is, a multiple gw of w, and let r = z — qw. The situation 
may look as follows: 
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* • 

•\* 

• •\ 


V\* 


•\« •,'kf" 


V'* •\* 
•\* 

• #-'* • 


Then we have obtained 

z — qw + r, 

and the norm of r is the square of the length of the segment between gw and z. 
Since this segment is contained in a box, and the square of the size of the box 
is IV (to), we have achieved 

N(r) < N( to), 

completing the proof. □ 

As we know, all sorts of amazing properties hold for the ring of Gaussian 
integers aa a consequence of Lemma 6.5: Z[j] is a PID and a UFD; irreducible 
elements are prime in Z[*]; greatest common divisors exist and may be found by 
applying the Euclidean algorithm, etc. Any one of these facts would seem fairly 
challenging in itself, but they are all immediate consequences of the existence of a 
Euclidean valuation and of the general considerations in §2. 

The fact that the norm is multiplicative simplifies further the analysis of the 
ring Z[i\. 

Lemma 6.6. The units o/Z[i] are ±1, ±t. 

Proof. If u is a unit in Z[i], then there exists v € Z[i] such that uv = 1. But 
then N(u)N(v) — N(uv) = N(l) = 1 by multiplicativity, so N (w) is a unit in Z. 
This implies jV(«) = 1, and the only elements in Z[»] with norm 1 are ±1, ±i. The 
statement follows. □ 

Lemma 6.7. Let q € Z[i] be a prime element. Then there is a prime integer p £ Z 
such that N(q) — p or N{q) — p 2 . 

Proof. Since q is not a unit, N(q) ^ 1 (by Lemma 6.6). Thus N(q) is a nontrivial 
product of (integer) primes, and since q is prime in Z[i] DZ , q must divide one 
of the prime integer factors of N(q); let p be this integer prime. But then q \ p 
in Z[i], and it follows (by multiplicativity of the norm) that N(q) | N(p) = p 2 . 
Since N(q) ^ 1, the only possibilities are N(q) = p and N(q) = p 2 , as claimed. □ 
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Both possibilities presented in Lemma 6.7 occur, and studying this dichotomy 
further will be key to the result in §6.3. But we can already work out several cases 
‘by hand’: 

Example 6.8. The prime integer 3 is a prime element of Z[ij; this can be verified 
by proving that 3 is irreducible in Z [*] (since Z[i] is a UFD). For this purpose, note 
that since IV (3) = 9, the norm of a factor of 3 would have to be a divisor of 9, 
that is, 1, 3, or 9. Gaussian integers with norm 1 are units, and those with norm 
9 are associates of 3 (Exercise 6.10); thus a nontrivial factor of 3 would necessarily 
have norm equal to 3. But there are no such elements in Z(i]; hence 3 is indeed 
irreducible. 


The prime integer 5 is not a prime element of Z[i]: running through the same 
argument as we just did for 3, we find that a factor of 5 should have norm 5, and 
2 + i does. In fact, 5 = (2 + t)(2 — t) is a prime factorization of 5 in Z[t]. 

Visually, what happens is that the lattice generated by 3 in Z(i] cannot be 
refined further into a tighter lattice, while the one generated by 5 does admit a 
refinement: 
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(the circles represent complex numbers with norm 3, 5, respectively). _i 

The reader is encouraged to work out several other examples and to try to figure 
out what property of an integer prime makes it split in Z[i] (like 5 and unlike 3). 
But this should be done before reading on! 

6.3. Fermat's theorem on sums of squares. Once armed with our knowledge 
about rings, playing with Z\i] a little further teaches us something new about Z — 
this is a beautiful and famous example of the success achieved by judicious use of 
generalization over brute force. 

First, let us complete the circle of thought begun with Lemma 6.7. Say that 
a prime integer p splits in Z\i] if it is not a prime element of Z[i]; we have seen in 
Example 6.8 that 5 splits in Z[»], while 3 does not. 

Lemma 6.9. A positive integer prime p e Z splits in Z[i] if and only if it is the 
sum of tuio squares in Z. 

Proof. First assume that p = a 2 + b 2 , with a,beZ. Then 

p = (a+ 6i)(a — 6i) 

in Z[t], and N{a ±bi) = a 2 + b 2 = p ^ 1, so neither of the two factors is a unit 
in Z[i] (Lemma 6.6). Thus p is not irreducible, hence not prime, in Z[»). 

Conversely, assume that p is not irredncihle in Z[t]: then it has an irreducible 
factor q € Z[i], which is not an associate of p. Since q | p, by multiplicativity of 
the norm we have N{q) \ N{p) = p 2 , and hence N(q) = p since q and p are not 
associates (thus N(q) ^ p 2 ) and q is not a unit (thus N(q) ^1). If q = a + 6*, we 
find 

p = N(q) = a 2 + b 2 , 

verifying that p is the sum of two squares and completing the proof. □ 

Thus, 2 splits (as 2 = l 2 + l 2 ), and so do 

5 = l 2 + 2 2 , 13 = 2 2 + 3 2 , 17 = l 2 + 4 2 , ... 

while 

3, 7, 11, 19, 23, ... 

remain prime if viewed as elements of Z[i]. 

The next puzzle is as follows: what else distinguishes the primes in the first 
list from the primes in the second listY Again, the reader who does not know the 
answer already should pause and try to come up with a conjecture and then prove 
that conjecture! Do not read the next statement before trying this on your own 25 ! 


25 The point we are trying to make is that these are not difficult facts, and a conscientious 
reader really is in the position of discovering and proving them on his or her own. This is extremely 
remarkable: the theorem we are heading towards was stated by Fermat, without proof, in 1640, 
and had to wait about one hundred years before being proven rigorously (by Euler, using 'infinite 
descent'). Proofs using Gaussian integers are due to Dedekind and had to wait another hundred 
years. The moral is, of course, that the modern algebraic apparatus acts as a tremendous amplifier 
of our skills. 
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Lemma 6.10. A positive odd prime integer p splits in Z[i] if and only if it is 
congruent to 1 modulo 4. 

Proof. The question is whether p is prime as an element of Z[i], that is, whether 
Z[*]/(p) is an integral domain. But we have isomorphisms 

Z|i) ~ ZM/(S 2 + 1) ~ „ VA /(P) ~ z/pZ[»] 

( P ) (p) (p,x 2 + l) (x 2 + 1) (x 2 + l) 

by virtue of the usual isomorphism theorems (including an appearance of Lemma 4.1) 
and taking some liberties with the language (for example, (p) means three different 
things, hopefully self-clarifying through the context). Therefore, 

p splits in Z[{\ % — > Z[si]/ (p) is not an integral domain 

Z/pZ[x]/(x 2 + 1) is not an integral domain 
<<=^> x 2 + 1 is not irreducible in Z/pZ[x] 

4=5- x 2 -f 1 has a root in Z/pZ 

<!=> there is an integer n such that n 2 = -1 mod p, 

and we are reduced to verifying that this last condition is equivalent to p = 1 mod 
4, provided that p is an odd prime. 

For this purpose, recall (Theorem IV.6.10) that the multiplicative group G of 
Z/pZ is cyclic; let g e <3 be a generator of G. Since p is assumed to be odd, p - 1 
is an even number, say 2 1: thus g has order |G| = 2t. Also, denote the class of an 
integer n mod (p) by n- Since g is a generator of G, for every integer n there is an 
integer m such that n = g m . 

The class ^_1 generates the unique subgroup of order 2 of G (unique by the 
classification of subgroups of a cyclic group, Proposition II.6.11); since </ does the 
same, we have 



Therefore, with n — g m , we see that n 2 = — 1 modp if and only if g 2m = 3 e , that 
is, if and only if 

2m = l mod 21. 

Summarizing, 3n € Z such that n 2 s — lmodp if and only if 3m € Z such that 
2m = l mod 21. Now this is clearly the case if and only if £ is even, that is, if and 
only if (p - 1) = is a multiple of 4, that is, if and only if p = 1 mod 4; and we 
are done. □ 

Putting together Leimna 6.9 and Lemma 6.10 gives the following beautiful 
number- theoretic statement: 

Theorem 6.11 (Fermat). A positive odd prime p eZ is a sum of two squares if 
and only if p = lmod4. 

Remark 6.12. Lagrange proved (in 1770) that every positive integer is the sum of 
four squares. One way to prove this is not unlike the proof given for Theorem 6.11: 
it boils down to analyzing the splitting of (integer) primes in a ring; the role of Z[i] 
is taken here by a ring of ‘integral quaternions’. j 
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The reader should pause and note that there is no mention of UFDs, Euclidean 
domains, complex numbers, cyclic groups, etc., in the statement of Theorem 6.11, 
although a large selection of these tools was used in its proof. This is of course the 
dream of the genexalizer: to set up an abstract machinery making interesting facts 
(nearly) evident — facts that would be extremely mysterious or that would require 
Fermat-grade cleverness to be understood without that machinery. 


Exercises 


6.1. Generalize the CRT for two ideals, as follows. Let I, J be ideals in a commu- 
tative ring R\ prove that there is an exact sequence of if-modules 

0 >inj x R >-2— >o 

1 J 1 J 

where <p is the natural map. (Also, explain why this implies the first part of 
Theorem 6.1, for k = 2.) 

6-2. Let R be a commutative ring, and let a € R be an element such that o 2 = a. 
Prove that R S R/{a) x R/( 1 - o). 

Show that the multiplication in R endows the ideal (a) with a ring structure, 
with a as the identity 26 . Prove that (a) = i?/(l — o) as rings. Prow that R = 
(a) x (1 — a) as rings. 

6.3. Recall (Exercise 3.15) that a ring R is called Boolean if o 2 = o for all a g R~ 
Let R. be a finite Boolean ring; prove that R. Sf Z/2Z x ■ ■ • x Z/2Z. 

6.4. Let R. be a finite commutative ring, and let p be the smallest prime dividing 

|R|. Let 7i, . . . , h be proper ideals such that U 4- Ij = (1) for i ^ j. Prove that 
k < log p |jR|. (Hint: Prove < |Ii| • ■ • |i*| < (|i2|/p)*-) 

6.5. Show that the map Z[x] — » Z[.-r]/(2) x Z[x\/{x) is not surjective. 

6.6. ▻ Let R. be a UFD. 

• Let a, 6 g R such that gcd(a, ft) = 1. Prove that (ft) ft (6) = (aft). 

• Under the hypotheses of Corollary 6.4 (but only assuming that R. is a UFD) 
prove that the function <p is injective. 

[0.1] 

6.7. > Find a polynomial / g Q[*j such that / m lmod(a; 2 +l) and / = *mod:c 100 . 
[0-1] 

6.8. -> Let n g Z be a positive integer and n = p® 1 • • -p° r its prime factoriza- 
tion. By the classification theorem for finite abelian groups (or, in fact, simpler 


26 This is an extremely unusual situation. Note that this ring (a) is not a subring of R if 
o 1 according to Definition III.2.5, since the identities in (a) and R differ. 
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considerations; cf. Exercise II.4.9) 

z ^ z z 

(») " (p?‘) x "■ x (p?') 

os abelian groups. 

• Use the CRT to prove that this is in fact a ring isomorphism. 

• Prove that 

/ Z V / Z \* f Z 

U»)J x '" x W r ) 

(recall that (Z/nZ)’ denotes the group of units of Z/rtZ). 

• Recall (Exercise II.6.14) that Euler’s 4>-f unction 4>(n) denotes the number of 
positive integers < n that are relatively prime to n. Prove that 

4 >( n ) = Pi 1-1 (Pi “ !) • ' •P^'-'CPr - 1)- 

pl.2.15, VII. 5. 19] 

6.9. Let I be an ideal of Z[i]. Prove that Z[i]/I is finite. 

6.10. i> Let z,w € Z(t). Show that if z and w are associates, then N(z) = N(w). 
Show that if w € (z) and N(z) = N(w), then z and w are associates. (§6.2] 

6.11. Prove that the irreducible elements in Z[i] are, up to associates: 1 + i; the 
integer primes congruent to 3 mod 4; and the elements a±bi with a 2 + b 2 aii integer 
prime congruent to 1 mod 4. 

6.12. -i Prove Lemma 6.5 without any ‘visual’ aid. (Hint: Let z = a+bi, w = c+di 

be Gaussian integers, with w ^ 0. Then z/w = ■ Find integers e, / 

such that |e — - p | < | and \f — and set q = e + if. Prove that 

I (j — q\ < f ■ Why does this do the job?) [6.13] 

6.13. -i Consider the set Z[\/2) = {a + by/2 \ a, b e Z} C C. 

• Prove that Z|\/2] is a ring, isomorphic to Z[t]/(t 2 — 2). 

• Prove that the function N : Z[\/2) — » Z defined by N(a + by/ 2) = a 2 — 2 fe 2 is 
multiplicative: N(zw) = N(z)N(w). (Cf. Exercise III. 4. 10.) 

• Prove that Z[\/2] has infinitely many units. 

• Prove that Z[>/2) is a Euclidean domain, by using the absolute value of TV as 
valuation. (Hint: Follow the same steps as in Exercise 6.12.) 

[6.14] 

6.14. Working as in Exercise 6.13, prove that Z['/— 2) is a Euclidean domain. (Use 
the norm N(a + by/^ 2) = a 2 + 2b 2 .) 

If you are particularly adventurous, prove that Z[(l + vd) /2] is also a Euclidean 
domain 27 for d = —3, -7,-11. (You can still use the norm defined by N(a+bVd) = 
a 2 — db 2 ; note that this is still an integer on Z](l + y/d)/ 2], if d = 1 mod 4.) 

27 You are probably wondering why we switched from Z[y/d] to Z((l + Vd)/ 2], These rings 
are the ‘ring of integers' in Q(V<f); the form they take depends on the class of d modulo 4. Their 
study is a cornerstone of algebraic number theory. 




302 


V. Irreducibility and factorization in integral domains 


The five values d = -1,-2, resp., —3, —7, —11, are the only ones for which 
Z[\/d\, resp., Z[(l + \/d)/2], is Euclidean. For the values d = —19, —43, —67, —163, 
the ring Z[(l + Vd)/ 2) is still a PID (cf. §2.4 and Exercise 2.18 for d = —19); the 
fact that there are no other negative values for which the ring of integers in Q(Vd) 
is a PID was conjectured by Gauss and only proven by Alan Baker and Harold 
Stark around 1966. Also, keep in mind that Z[\/^5j is not even a UFD, as you 
have proved all by yourself in Exercise 1.17. 

6.15. Give an elementary proof (using modular arithmetic) of the fact that if an 
integer n is congruent to 3 modulo 4, then it is not the sum of two squares. 

6.16. Prove that if m and n are two integers both of which can he written as sums 
of two squares, then mn can also be written as the sum of two squares. 

6.17. Let n be a positive integer. 

• Prove that n is a sum of two squares if and only if it is the norm of a Gaussian 
integer a + bi. 

• By factoring a 2 + 6 2 in Z and a + bi in Zfi], prove that n is a sum of two squares 
if and only if each integer prime factor p of n such that p s 3 mod 4 appears 
with an even power in n. 

6.18. -i One ingredient in the proof of Lagrange’s theorem on four squares is the 
following result, which can be proven by completely elementary means. Let p > 0 
be an odd prime integer. Then there exists an integer n, 0 < n < p, such that np 
may he written as 1 + a 2 + b 2 for two integers a, b. Prove this result, as follows: 

• Prove that the numbers a?, 0 < a < (p - l)/2, represent (p 4- l)/2 distinct 
congruence classes mod p. 

• Prove the same for numbers of the form — 1 — 6 2 , 0 < b < (p — l)/2. 

• Now conclude, using the pigeon-hole principle. 

(621) 

6.19. -■ Let I C m be the set of quaternions (cf. Exercise III.1.12) of the form 
f (1 + * + j + k) + bi + cj + dk with a, b, c, d € Z. 

• Prove that I is a (noncommutative) subring of the ring of quaternions. 

• Prove that the norm N(w) (Exercise III.2.5) of an integral quaternion w e I is 
an integer and N(w\W 2 ) = N(wi)N(w- 2 ). 

• Prove I has exactly 24 units in I: ±1, ±j, ±k, and j(±l ±i±j±k). 

• Prove that every' w € H is an associate of an element a + bi + cj + dk € I with 
a, b,c,d(z Z. 

The ring H is called the ring of integral quaternions. (6.20, 6.21] 

6.20. -i Let D be as in Exercise 6.19. Prove that K shares most good properties of 
a Euclidean domain, notwithstanding the fact that it is noncommutative. 

• Let 2,»5l, with mi ^ 0. Prove that 3?,r € 1 such that z — qw + r, with 
N(r) < N(w). (This is a little tricky; don’t feel too bad if you have to cheat 
and look it up somewhere.) 
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• Prove that every left-ideal in I is of the form Jhu for some to € I. 

• Prove that every z, w € 1, not both zero, have a ‘greatest common right-divisor’ 
d in I, of the form az + /3w for a, /? 6 1. 

[6.21] 

6.21. Prove Lagrange's theorem on four squares. Use notation as in Exercises 6.19 

and 6.20. 

• Let z 6 I and n € Z. Prove that the greatest common right-divisor of z and n 
in 1 is 1 if and only if (N(z),m) = 1 in Z. (If az + fin = 1, then N(a)N(z) = 
N( 1 - ffn) = (1 — /?n)(l - J2n), where is obtained by changing the signs of the 
coefficients of i, j, k. Expand, and deduce that (JV(z),n) | 1.) 

• For an odd prime integer p, use Exercise 6.18 to obtain an integral quaternion 
z = l+at+ij such that p | N(z). Prove that z and p have a common right-divisor 
that is not a unit and not an associate of p. 

• Say that w 6 I is irreducible if w = a0 implies that either a or j9 is a unit. Prove 
that integer primes are not irreducible in I. Deduce that every positive prime 
integer is the norm of some integral quaternion. 

• Prove that every positive integer is the norm of some integral quaternion. 

• Finally, use the last point of Exercise 6. 19 to deduce that every positive integer 
may be written as the sum of four perfect squares. 
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Linear algebra 


In several branches of science, ‘algebra’ means ‘linear algebra’: the study of vector 
spaces and linear maps, that is, of the category of modules over a ring R in the very 
special case in which I? is a field (and often one restricts attention to very special 
fields, such as R or C). 

This will be one of the main themes in this chapter. However, we will stress 
that much of what can be done over a field can in fact be done over less special rings. 
In fact, we will argue that working out the general theory over integral domains 
produces invaluable tools when we are working over fields: the paramount example 
being canonical forms for matrices with entries in a field, which will be obtained as 
a corollary of the classification of finitely generated modules over a PID. 

Throughout the main body of this chapter (but not necessarily in the exercises) 
R will denote an integral domain; most of the theory can be extended to arbitrary 
commutative 1 rings without major difficulty. 

1. Free modules revisited 

1.1. i?-Mod. For generalities on modules, see §111.5. A module over R is an abelian 
group M, endowed with an action of R. The action of r € R on me M is denoted 
rm: there is a notational bias towards fe/t-modules (but the distinction between 
left- and right-modules will be immaterial here as R is commutative). The defining 
axioms of a module tell us that for all r i , r 2 , r e R and to, mi , mi e M, 

• (n + r 2 )m = nm + r 2 m, 

• lro = m and (r 1 r 2 )m = ri(r 2 m), 

• r(mi + m 2 ) = rmi + rm 2 . 

l The hypothesis of commutativity is very convenient, as it allows us to identify the notion of 
left - and Wyht-modules, and the fact that integral domains have fields of fractions simplifies many 
arguments. The reader should keep in mind that many results iu this chapter can be extended to 
more general rings. 
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Modules over a ring R. form the category i?-Mod, which we encountered in 
Chapter HI. This category reflects subtle and important properties of R, and we 
are going to attempt to uncover some of these features. As a first approximation, 
we look at the ‘full subcategory’ (cf. Exercise 1.3.8) of J?-Mod whose objects are 
free modules and in which inorphisms are ordinary morphisms in fl-Mod, that is, 
/^-linear group homomorphisms. 

The goal of the first few sections of this chapter is to give a very explicit 
description of this suhcategory: in the case of finitely generated free modules, this 
can be done by means of matrices with entries in R. Later in the chapter, we 
will see that matrices may also be used to describe important classes of nonfree 
modules. 

1.2. Linear independence and bases. The reader is invited to review the defi- 
nition of free i?-modules given in §111.6.3: F R (S) denotes an ii-module containing 
a given set S and universal with respect to the existence of a set-map from S. We 
proved (Claim 111.6.3) that the module f?® s with ‘one component for each element 
of S' gives an explicit realization of F n (S). 

The main point of this subsection will be that, for reasonable rings R, the 
set S can be recovered 2 ‘abstractly’ from the free module F R (S). For example, it 
will foEow easily that R m 2 H" if and only if m = n; this is the first indication 
that the category of (finitely generated) free modules does indeed admit a simple 
description. 

Our main tool will be the famous concepts of linearly independent subsets mid 
bases. It is easy to be imprecise in defining these notions. In order to avoid obvious 
traps, we will give the definitions for indexed sets (cf. §1.2.2), that is, for functions 
i : I -* M from a (nonempty) indexing set I to a given module M. The reader 
should think of i as a selection of elements of M , allowing for the possibility that 
the elements m a G M corresponding to a € I may not all be distinct. 

Recall that for all sets I there is a canonical injection j : I — > F R (I) and any 
function i : I — * M determines a unique ^-module homomorphism g> : F R (I) — * M 
making the diagram 



commute: this is precisely the universal property satisfied by F R { f). 

Definition 1.1. We say that the indexed set i : I —► M is linearly independent if 
<p is injective; i is linearly dependent otherwise. We say that i generates M if <p is 
surjective. -i 

Put in a slightly messier, but perhaps more common, way, an indexed set 
S = {m a } a€ / of elements of M is linearly independent if the only vanishing linear 


2 This is not necessarily the case if R is not commutative. 
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combination 

= 0 

is obtained by choosing r a = 0 Va e /; S is lineally dependent otherwise. The 
indexed set generates M if every element of M may be written as £ oS r r Q m a for 
some choice of r a . (As a notatiorial warning/ reminder, keep in mind that only finite 
sums are defined in a module; therefore, in this context the notation £ stands for 
a finite sum. When writing £ q€7 m a in an ordinary module, one is implicitly 
assuming that m„ = 0 for all but finitely many a G /.) 

Using indexed sets in Definition 1.1 takes care of obvious particular cases such 
as m and m being linearly dependent since 1 • m + (—1) • m = 0: if i : I — * M is not 
itself injective, then <p is surely not injective (by the commutativity of the diagram 
and the injectivity of j), so that i is linearly dependent in this case. Because of 
this fact, the datum of a linearly independent i : I — » M amounts to a (special) 
choice of distinct elements of M\ the temptation to identify the elements of I with 
the corresponding elements of M is historically irresistible and essentially harmless. 
Thus, it is common to speak about linearly dependent/independent subsets of M. 
We will conform to this common practice; the reader should parse the statements 
carefully and correct obvious iinprecisions that may arise. 

A simple application of Zom’s lemma shows that every module has maximal 
linearly independent subsets. In fact, it gives the following conveniently stronger 
statement: 

Lemma 1.2. Let M be an B -module, and let S C M be a linearly independent 
subset Then there exists a maximal linearly independent subset of M containing 
S. 

Proof. Consider the family y of linearly independent subsets of M containing S, 
ordered by inclusion. Since S is linearly independent, y 0. By Zorn’s lemma, it 
suffices to verify that every chain in y has an upper bound. Indeed, the union of 
a chain of linearly independent subsets containing S is also linearly independent: 
because any relation of linear dependence only involves finitely many elements and 
these elements would all belong to one subset in the chain. □ 

Remark 1.3. This statement is in feet known to be equivalent to the axiom of 
choice; therefore, the use of Zorn’s lemma in one form or another cannot be by- 
passed. -i 

Note that the singleton {2} C Z is a ‘maximal linearly independent subset’ of 
Z, but it does not generate Z. In general this is an additional requirement, leading 
to the definition of a basis. 

Definition 1.4. An indexed set B — Af is a basis if it generates M and is linearly 
independent. -i 

Again, one often talks of bases as ‘subsets’ of the module M; since the images 
of all b 6 B are necessarily distinct elements, this is rather harmless. When B is 
finite (or at any rate countable), the extra information carried by the indexed set 
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can be encoded by ordering the elements of B; to emphasize this, one talks about 
ordered bases. 

Bases are necessarily maximal linear independent subsets and minimal gener- 
ating subsets; this holds over every ring. What will make modules over a field, i.e., 
vector spaces, so special is that the converse will also hold. 

In any case, only very special modules admit bases: 

Lemma 1.5. An B -module M is free if and only if it admits a basis. In fact, 
B C M is a basis if and only if the natural homomorphism —> M is an 
isomorphism. 

Proof. This is immediate from Definition 1,1: if B C M is linearly independent 
and generates M, then the corresponding homomorphism i?.® B — * M is injective 
and surjective. Conversely, if <p : R® B — * M is an isomorphism, then B is identified 
with a subset of M which generates it (because <p is surjective) and is linearly 
independent (because <p is injective). □ 

By Lemma 1.5, the choice of a basis B of a free module M amounts to the 
choice of an isomorphism i?® B Si M; this null be an important observation in due 
time (e.g., in §2.2). 

Once a basis B has been chosen for a free module M, then every element m e M 
can be written uniquely as a linear combination 

m = r i>5 
b€B 

with n € R. As always, remember that all but finitely many of the coefficients n 
are 0 in any such expression. 

1.3. Vector spaces. Lemma 1.5 is all that is needed to prove the fundamental 
observation that modules over a field are necessarily free. Bacall that modules over 
a field k are called k-vector spaces (Example I1I.5.5). Elements of a vector space 
are called (surprise, surprise) vectors, while elements of the field are called 3 scalars. 

By Lemma 1.5, proving that vector spaces are free modules amounts to proving 
that they admit bases; Lemma 1.2 reduces the matter to the following: 

Lemma 1.6. Let R. = k be a field, and let V be a k-vector space. Let B be a 
maximal linearly independent subset of V; then B is a basis of V. 

Again, this should be contrasted with the situation over rings: {2} is a maximal 
linearly independent subset of Z, but it is not a basis. 

Proof. Let v G V, v <£ B. Then B U {«} is not linearly independent, by the 
maximality of B\ therefore, there exist co, . . . , ct € k and (distinct) hi, ... ,bt € B 
such that 

cqu + ci&i -)- 1- ctbt = 0, 


3 


This terminology is also often used for free modules over any ring. 
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with not all co, . . . , c ( equal to 0. Now, co ^ 0: otherwise we would get a linear 
dependence relation among elements of B. Since k is a field, co is a unit; but then 

v = (~<V l ei)bi + • • • + (-of l ct)b t , 

proving that v is in the span of B. It follows that B generates V, as needed. □ 
Summarizing, 

Proposition 1.7. Let B. = k be a field, and let V be a k-vector space. Let S 
be a linearly independent set of vectors of V . Then there exists a basis B of V 
containing S. 

In particular, V is free os a k-module. 

Proof. Put Lemma 1.2, Lemma 1.5, and Lemma 1.6 together. □ 

We could also contemplate this situation from the ‘mirror’ point of view of 
generating sets: 

Lemma 1.8. Let R. = k be a field, and let V be a k-vector space. Let B be a 
minimal generating set for V; then B is a basis of V. 

Every set generating V contains a basis of V. 

Proof. Exercise 1.6. □ 

Lemma 1.8 also fails on more general rings (Exercise 1.5). To reiterate, over 
fields (but not over general rings) a subset B of a vector space is a basis •$=*> it is 
a maximal linearly independent subset 4=s- it is a minimal generating set. 

1.4. Recovering B from F R (B). We are ready for the ‘reconstruction’ of a set 
B (up to a bijection!) from the corresponding free module F R (B). This is the 
result justifying the notion of dimension of a vector space, or, more generally, the 
rank of a free module. Again, we prove a somewhat stronger statement. 

Proposition 1.9. Let R. be an integral domain, and let M be a free R.-module. 
Let B be a maximal linearly independent subset of M, and let S be a linearly 
independent subset. Then 4 |Sj < |B|. 

In particular, any two maximal linearly independent subsets of a free module 
over an integral domain have the same cardinality. 

Proof. By taking fields of fractions, the general case over an integral domain is 
easily reduced to the case of vector spaces over a held; see Exercise 1.7. We may 
then assume that R = k is a field and M = V is a fc-vector space. 

We have to prove that there is an injective map j : S <— » B, and this can be 
done by an inductive process, replacing elements of B by elements of S ‘one-by- 
one’. For this, let < be a well-ordering on S', let v e S , and assume we have defined 
j for all tr € S with w < v. Let B 1 be the set obtained from B by replacing all 

‘’Here, |A[ denotes the cardinality of the set A , a notion with which the reader is hopefully 
familiar. The reader will not lose much by only considering the case in which B, S are finite sets; 
but the fact is true for ‘infinite-dimensional spaces’ as well, as the argument Bhows. 
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j( w) by w, for w < v, and assume (inductively) that B' is still a maximal linearly 
independent subset of V. Then we claim that j(v) € B may be defined so that 

• j( v ) i* j(w) fc> r all w < v; 

• the set B" obtained from B' by replacing j(v) by v is still a maximal linearly 
independent subset. 

(Hransfinite) induction (Claim V.3.2) then shows that j is defined and injective on 
S, as needed. 

To verify our claim, since B' is a maximal linearly independent set, B' U {»} 
is linearly dependent (as an indexed set 5 ), so that there exists a linear dependence 
relation 

(*) cqv + crfci + ■■•+■ ctb t = 0 

with not all d equal to zero and the bi distinct in B'. Necessarily co ^ 0 (because 
B' is linearly independent); also, necessarily not all the bi with Cj ^ 0 are elements 
of S (because S is linearly independent). Without loss of generality we may then 
assume that ci ^ 0 and bi € B' x S. This guarantees that &i / j(w) for all w < v; 
we set j(v) = fci- 

All that is left now is the verification that the set B" obtained by replacing b t 
by v in & is a maximal linearly independent subset. But by using (*) to write 

v = — Co 1 cj f>i ef 'cjbe, 

this is an easy consequence of the fact that B' is a maximal linearly independent 
subset. Farther details are left to the reader. □ 

Example 1.10. An uncountable subset of C[x] is necessarily linearly dependent. 
Indeed, C[r] has a countable basis over C: for example, {1, a:, a: 2 ,® 3 ,...}. j 

Corollary 1.11. Let B. be an integral domain, and let A, B be gets. Then 
F fl (j4) = F r (B) -*==► there is a bisection A = B. 

Proof. Exercise 1.8. □ 

Remark 1.12. We have learned in Lemma 1.2 that we can ‘complete’ every lin- 
early independent subset S to a maximal one. The argument used in the proof of 
Proposition 1.9 shows that we can in fact do this by borrowing elements of a given 
maximal linearly independent subset. -j 

Remark 1.13. As a particular case of Corollary 1.11, we see that if A is an integral 
domain, then BA 1 — B n if and only if m = n. This says that integral domains satisfy 
the ‘IBN (Invariant Basis Number) property’. 

Strange as it may seem, this obvious-looking fact does not hold over arbitrary 
rings: for example, the ring of eudomorphisius of an infinite-dimensional vector 
space does not satisfy the IBN property. On the other hand, integral domains are a 
bit of an overshoot: all commutative rings satisfy the IBN property (Exercise 1.11). 

®This allows for the possibility that V £ B : ‘already’. In this case, the reader can check that 
the process we are about to describe gives j{v) — v. 
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One way to think about this is that the category of finitely generated free 
modules over (say) an integral domain is ‘classified’ by Z-°: up to isomorphisms, 
there is exactly one finitely generated free module for any nonnegative integer. The 
task of describing this category then amounts to describing the homomorphisms 
between objects corresponding to two given nonnegative integers; this will be done 
in §2.1. j 

As a byproduct of the result of Proposition 1.9, we can now give the following 
important definition. 

Definition 1.14. Let R be an integral domain. The rank of a free A-module M, 
denoted rk/e M, is the cardinality of a maximal linearly independent subset of M. 
The rank of a vector space is called the dimension, denoted dim* V. _i 

This definition will in fact be adopted for more general finitely generated mod- 
ules when the time comes, in §5.3. 

Finite-dimensional vector spaces over a fixed field form a category. Since vector 
spaces are free modules (Proposition 1.7), Corollary 1.11 implies that two finite- 
dimensional vector spaces are isomorphic if and only if they have the same dimen- 
sion. 

The subscripts R, k are often omitted, if the context permits. But note that, for 
example, viewing the complex numbers as a real vector space, we have dimnC = 2, 
while dirncC = 1. So some care is warranted. 

Proposition 1.9 tells ns that every' linearly independent subset S of a free R- 
module M must have cardinality lower than rkn M. Similarly, every generating set 
must have cardinality' higher than the rank. Indeed, 

Proposition 1.15. Let R. be an integral domain, and let M be a free R -module; 
assume that M is generated by S: M = (S). Then S contains a maximal linearly 
independent subset of M. 


Proof. By Exercise 1.7 we may assume that R is a field and M = V is a vector 
space. Use Zorn’s lemma to obtain a linearly independent subset B C S which is 
maximal among subsets of S. Arguing as in the proof of Lemma 1.6 shows that 
S is in the span of B, and it follows that B generates V. Thus B is a basis, and 
hence a maximal linearly independent subset of V, as needed. □ 

Remark 1.16. We have used again the trick of switching from an integral domain 
to its field of fractions. The second part of the argument would not work over 
an arbitrary integral domain, since maximal linearly independent subsets over an 
integral domain are not generating sets in general. 

Another standard method to reduce questions about modules over arbitrary 
commutative rings to vector spaces is to mod out by a maximal ideal; cf. Exercise 1 .9 
and following. j 
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Exercises 


1.1. -i Prove that R and C are isomorphic as Q -vector spaces. (In particular, (R, +) 
and (C, +) are isomorphic as groups.) [II.4.4] 

1.2. -i Prove that the sets listed in Exercise 111.1.4 are all R- vector spaces, and 
compute their dimensions. [1.3] 

1.3. Prove that «« 2 (C) = so s(R) as R-vector spaces. (This is immediate, and not 
particularly interesting, from the dimension computation of Exercise 1.2. However, 
these two spaces may be viewed as the tangent spaces to SU 2 (C), resp., SC> 3 (R), 
at 7; the surjective homomorphism SU 2 (C) -» SOa(R) you constructed in Exer- 
cise 11.8.9 induces a more ‘meaningful’ isomorphism SU 2 (C) —* 303 (E). Can you 
find this isomorphism?) 

1.4. Let V be a vector space over a field k. A Lie bracket on V is an operation 
[-,-]:FxF->Fsuchthat 

• (V«, v, w € F), (V«, bek), 

[au + 6 t>,u»] = a[u, w] + h[v,tv], [w,au + he] = a[w,tt] + hjw.v], 

• (Ve € F), [v, e] = 0, 

• and (Vu, v,w e V), [[u,e],u»] + [[v,te],u] + [[«),«], e] = 0. 

(Tliis axiom is called the Jacobi identity.) A vector space endowed with a Lie 
bracket is called a Lie algebra. Define a category of Lie algebras over a given field. 
Prove the following: 

• In a Lie algebra V, [w, w] = -[v,«] for all u,v € V. 

• If V is a fc- algebra (Definition III.5.7), then := vw — wv defines a Lie 
bracket on V , so that V is a Lie algebra in a natural way. 

• Tliis makes g(„(R), g(„(C) into Lie algebras. The sets listed hi Exercise III. 1.4 
are all Lie algebras, with respect to a Lie bracket induced from g(. 

• su 2 (C) and st> 3 (R) are isomorphic as Lie algebras over R. 

1.5. > Let R be an integral domain. Prove or disprove the following: 

• Every linearly independent subset of a free /^-module may be completed to a 
basis. 

• Every generating subset of a free B-module contains a basis. 

[§l-3] 

1 . 6 . > Prove Lemma 1.8. [§1.3] 

1.7. > Let Ft. be an integral domain, and let M = /?.® A be a free l?-module. Let K 
be the field of fractions of R, and view M as a subset of V — K bA in the evident 
way. Prove that a subset S C M is linearly independent in M (over 7?) if and 
only if it is linearly independent in V (over K). Conclude that the rank of M (as 
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an E-module) equals the dimension of V (as a A-vector space). Prove that if S 
generates M over R, then it generates V over K. Is the converse true? [§1.4] 

1.8. > Deduce Corollary 1.11 from Proposition 1.9. [§1.4] 

1.9. > Let R be a commutative ring, and let M be an f?- module. Let m be a 
maxima] ideal in R, such that m M = 0 (that is, rm = 0 for all r € m, m e M). 
Define in a natural way a vector space structure over E/m on M. [§1.4] 

1.10. -> Let R. be a commutative ring, and let F = R® B be a free module over R. 
Let m be a maximal ideal of E, and let k = E/m be the quotient field. Prove that 
F/mF 3 as fc- vector spaces. [1.11] 

1.11. > Prove that commutative rings satisfy the IBN property. (Use Proposi- 
tion V.3.5 and Exercise 1.10.) [§1.4] 

1.12. Let V be a vector space over a field k, and let R, = Endj,._v«ct(V) be its ring of 
endomorphisms (cf. Exercise 111,5.9). (Note that R is not commutative in general.) 

• Prove that Eudfc.v«*(V' © V) S R* as an E-module. 

• Prove that E does not satisfy the IBN property if V = fc® N . 

(Note that V as V © V if V = fc®") 

1.13. -> Let A be an abelian group such that EndAb(A) is a field of characteristic 0. 
Prove that A = Q. (Hint: Prove that A carries a Q- vector space structure; what 
must its dimension be?) [DC.2.13] 

1.14. -> Let V be a finite-dimensional vector space, and let ip : V — v V be 
a homomorphism of vector spaces. Prove that there is an integer n such that 
keryj" +1 = ker <p n and imy> n+1 = im^)". 

Show that both claims may fail if V has infinite dimension. [1.15] 

1.15. Consider the question of Exercise 1.14 for free E-moduLes F of finite rank, 
where R is an integral domain that is not a field. Let ip : F — *■ F be an E-module 
homomorphism. 

What property of R. immediately guarantees that ker^> n+l = ker ip n for n 3> 0? 
Show that there is an E-module homomorphism <p : F —* F such that im ip n+l £ 
<p n for all n > 0, 

1.16. -■ Let M be a module over a ring E. A finite composition series for M (if it 
exists) is a decreasing sequence of submodules 

M = Mo 2 JWi 2 ■ - • 2 M m = (0) 

in which all quotients Mj/Mi +1 are simple E-modules (cf. Exercise III.5.4). The 
length of a series is the number of strict inclusions. The composition factors are 
the quotients Mi/Mi+i. 

Prove a Jordan-Holder theorem for modules: any two finite composition series 
of a module have the same length and the same (multiset of) composition factors. 
(Adapt the proof of Theorem IV .3.2.) 
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We say that M has length m if M admits a finite composition series of length m. 
This notion is well-defined as a consequence of the result you just proved. [1.17, 

1.18, 3.20, 7.15] 

1.17. Prove that a fc-vector space V has finite length as a module over k (cf. Ex- 
ercise 1.16) if and only if it is finite-dimensional and that in this case its length 
equals its dimension. 

1.18. Let M be on F-module of finite length m (cf. Exercise 1.16). 

• Prove that every submodule N of M has finite length n < m. (Adapt the proof 
of Proposition IV .3.4.) 

• Prove that the ‘descending chain condition’ (d.c.c.) for submodules holds in M. 
(Use induction on the length.) 

• Prove that if R is an integral domain that is not a field and F is a free F-module, 
then F has finite length if and only if it is the 0-ruodule. 

1.19. Let k be a field, and let f(x) € be any polynomial. Prove that there 
exists a multiple of f(x) in which all exponents of nonzero monomials are prime 
integers. (Example: for f(x) = 1 + x 5 + x 6 , 

(1 + x 6 + x*)(2x 2 - x 3 + x 5 - x a + x s - x i0 + x 11 } 

= 2x 2 - x 3 + x s + 2x t + 2x n - x ls + x lT .) 
(Hint: fc(x]/(/(x)) is a finite-dimensional fc- vector space.) 

1.20. -> Let A , B be sets. Prove that the free groups F(A), F(B) (§11.5) are 
isomorphic if and only if there is a bijection A = B. (For the interesting direction: 
remember that F(A) SS F(B) =S F ab (A) “ F ab (B), by Exercise II.7.12). This 
extends the result of Exercise II.7.13 to possibly infinite sets A, B. [II.5.10] 


2. Homomorphisms of free modules, I 

2.1. Matrices. As pointed out in §1, Corollary 1.11 amounts to a classification 
of free modules over an integral domain: if F is a free module, then there is a 
set A (determined up to a bijection) such that F = B® A . The choice of such an 
isomorphism is precisely the same thing as the choice of a basis of F (Lemma 1.5). 

This is simultaneously good and bad news. The good news is that if F \ , Fj are 
free, then it must be possible to ‘understand’ 6 

Honiij(Fi, F?) 

entirely in terms of the corresponding sets A\, A<i such that Fj = K ® Al , Fj = F.®'* 3 . 
That is, this set of morphisms in F-Mod may be identified with 

Hom R (F® jll ,F® >43 ). 

The bad news is that this identification Hom^Fi, F 2 ) = Hom/?(F®' 4l ) F®^ 2 ) 
is not ‘canonical’, because it depends on the chosen isomorphisms Fi S F® 4 *, 

6 For notational simplicity we will denote Horn ft. Mod (A/, N) by Horn this is very 

common, and no confusion is likely. 
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Ft = R mA ^, that is, on the choice of bases. We will therefore have to do some work 
to deal with this ambiguity (in §2.2). 

The point of this subsection is to deal with the good news, that is, describe 
Hom^R® 4 ',#® 43 ). This can be done in a particularly convenient way when the 
free modules are finitely generated, the case we are going to analyze more carefully. 
Also recall (from §111.5.2) that one of the good features of the category R-Mod 
is that the set of morphisms Horn j?(M, N) between two R-modules is itself an 
R-module, in a natural way. The task is then to describe as explicitly as possible 7 

Hom^R^.R”*) 

os on R-module, for every choice of m, n € Z-°. 

This will be done by means of matrices with entries in JR. We trust that the 
reader is familiar with the general notion of an m x n matrix; we have occasionally 
used matrices in examples given in previous chapter's, and they have showed up 
in several exercises. An m x n matrix with entries in R is simply a choice of m# 
elements of R. It is common to arrange these elements as an array consisting of m 
rows and n columns: 

Ml r u ■■■ r ln > 

C21 rn ' • • T2n 

(j'^j')i=l,. J m — ... 

j=i,— ,« ; : : 

V ml r m2 ' ' * r run/ 

with ry € R. For any given m, n the set M m ,n(R) of m x n matrices with entries 
in R is an abelian group under entrywise addition 8 

(oy) + (&£j) ■“ (uy bij) 

(cf. Example 11.1.5); this is in fact an R-module, under the action 

r(ay) := (my) 

for r € R. From this point of view, the set of m x n matrices is simply a copy of 
the .R-module R mn . 

However, there are other interesting operations on these sets. If A — (a,-*,) is 
aimxp matrix and B = (6 jy) is a p x n matrix, then one may define the product 
of A and B as 

v 

A - B = ( ait ) ■ ( bkj ) := (^ «i*hjy); 

k=l 

this operation is (clearly) distributive with respect to addition and compatible with 
the R-module structure. It is just a tiny bit messier to check that it is associative, 
in the sense that if A, B, C are matrices of size, respectively, m x p, p x q, and 
q x n, then 

(AB)C= A{BC). 


7 We are now writing R n for what was denoted R a " in previous chapters. This is a slight 
abuse of language, but it makes for easier typesetting. 

s Note that we will be dropping the extra subscripts giving the range of the indices and will 
write (»,,) rather than (r y ) i=i...., m : these subscripts are an eyesore, and the context usually 
J=l.— ,n 

makes the information redundant. 
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The reader should have no trouble reconstructing the proof of this fact (Exer- 
cise 2.2). 

In particular, we have a binary operation on the set of square n x «- 

matrices, and this operation is associative, distributive w.r.t. +, and admits the 
identity element 

( i o ... 0\ 

0 1 ... 0 

\o 0 ... 1; 

(the identity matrix, denoted /„). That is, M n {R) is a ring (Example III.1.6) and 
in fact an A-algebra (Exercise 2.3). Except in very special cases (such as n = 1) 
this ring is not commutative. 

Matrices of type nx 1 are called column n-vectors; matrices of type 1 x m 
are called row m -vectors, for evident reasons. An element of a free f?-module R n 
is nothing but the choice of n elements of R , and we can arrange these elements 
into row or column vectors if we please; the standard choice is to represent them 
as column vectors: 

/"A 


W 

We will denote by e; the elements of the ‘standard basis’ of R n : 



( A 




fo\ 

ei = 

0 

. «2 = 

1 

, . . . , e n - 

0 


W 


W 


V) 


so that 

v = : =Z.W 
VV j=1 

The elements Vj € R are the ‘components’ of v. 

Interpreting elements of HI 1 as column vectors, we can act on H" with anmxn 
matrix, by left-multiplication: if A = (ay) is an m x n matrix and v e i?. n is a 
column vector, the product is a column vector in R 


fa-u 

<*12 



M\ 


( a u v, + a, 2 t> 2 4- • ■ 

■ + O lT1 tl„ \ 

a 2i 

022 

02 n 


V2 

— 

021 Vi + a 22 t> 2 H — 

• + 02nV„ 

Wi 

0»n2 

O m n/ 


VW 


\OmlOl 4- o m2 v 2 + • 

' 4 a mn v n J 


Lemma 2.1. For allmxn matrices A with entries in R: 

• The function ip : R n —* R m defined by <p(v) = A • v is a homomorphism of 
R-modtiles. 
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• Every R-moditle homomorphism R n —* R m is determined, in this way by a 
unique m X n matrix. 

Proof. The first point follows immediately from the elementary properties of ma- 
trix multiplication recalled above: Vr, « € /?, Vv, w € 1?" 

<p(rv + sw) = A • (rv + sw) = rA ■ v + sA ■ w = ry>(v) + sy>(w) 

as needed. 

For the second point, let <p : R n — * R m be a homomorphism of i?-modules; let 
Oij be the t-th component of <p(ej), so that 



is an m x n matrix, and Vv € R n with components Vj, 
a n v x + 012^2 + ' ' • + Gln^n > 

«21«1 + «22«2 + 1" «2re«n 

+ a m 2V2 + • • • + a mn v n J 

n 

= V»(£ Vj*}) = V=(v). 

2=1 

as needed. The homomorphism induced by a nonzero matrix is manifestly nontriv- 
ial; this implies that the matrix A associated to a homomorphism <p is uniquely 
determined by ip. □ 

The reader should attempt to remember the recipe associating a matrix A to 
a homomorphism <p as in the proof of Lemma 2.1: the j-th column of A is simply 
the column vector <p(ej). The collection of these vectors determines <p — since a 
homomorphism is determined by its action on a set of generators. 

Lemma 2.1 yields the promised explicit description of Hom R (.ft", R m ): 

Corollary 2.2. The correspondence introduced in Lemma 2.1 gives an isomorphism 
of R -modules 

M m , n (R) = Hom R (ir,i?. ro ). 

Proof. The reader will check that the correspondence is a bijective homomorphism 
of ^modules; this is enough, by Exercise III.5.12. □ 

This is very good news, and it gets even better. Don’t forget that R-Mod is a 
category; that is, we can compose morphisms. Therefore, there is a function 9 

Hom R (f? p , R m ) x Hom R (tfM? p ) -> Hom«(/T, R m ), 

9 Here we am reversing the order of the Horn sets on the left w.r.t. the convention used in 
Definition 1.3.1. 


= £ 
j-i 


( a U v i\ 

« 2 jVj 


3=1 


Then A = (a^) 
A ■ v = f 
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mapping ( <p , ip) to <p o ip\ on the other hand, the matrix product gives a function 

X Adp >n {B) ~ * Ad mi n(R), 

mapping (A, B) to A ■ B. That is, we have a diagram 
Ai mt p(B) X A ip <n (ft) 


Hom*(/?P, R m ) x Hom*(l?. n , R») > Hom ii (H",f? m ) 

where the vertical maps are (induced by) the isomorphisms obtained in Corol- 
lary 2.2. 

Lemma 2,3. This diagram commutes. That is, the matrix corresponding to a 
composition tpoip is the product of the matrices corresponding to <p and ip. 

Proof. This follows immediately from the associativity of matrix multiplication: 
for v e FT 1 and A € M m , v (R), B € M p , n (R), 

A ■ (B ■ v) = (A ■ B) v; 

that is, the successive action of B and A is equivalent to the action of A ■ B. □ 

Here is a summary of the situation. Given an integral domain R, we can 
construct a ‘toy category’ as follows: we let Z-° be the set of objects, and we 
define the set of morphisms Horn (it, m) to be the set of matrices M m ,n(B), with 
composition given by the matrix product (cf. Exercise 1.3.6). Then we have verified 
that this toy category is ‘essentially the same as’ the category of finitely generated 
free i?-modules and fZ-module homomorphisms 10 . 

For example, by virtue of Proposition 1.7, if R = k is a field, then J?-Mod = 
fc-Vect consists exclusively of free fe-modules. The toy category we just described 
is a faithful snapshot of the category of finite-dimensional fc- vector spaces. 

2.2. Change of basis. It is time to deal with the bad news. 

Flee modules are only defined up to isomorphism , as is any structure satisfying 
a universal property. Writing a free module as a direct sum R® A amounts to 
making the choice of one specific realization. As we pointed out at the end of §1.2, 
by Lemma 1.5 this choice is equivalent to the choice of a basis of the module. 

The price to pay for representing homomorphisms of free modules by matrices 
is that this correspondence relies on the choice of a basis: we say that it is not 
canonical. It is essential to be able to beep track of this choice. For finitely generated 
free modules, this also boils down to the action of a matrix, as we proceed to see. 

Let F be a free module, and choose two bases A, B for F; assume that F is 
finitely generated, so that A, B are finite sets, and further |A| = |B| (= rkF) by 


10 The reader should not take this statement too seriously: we have identified together all 
free modules isomorphic to a given one, which is a rather drastic operation. We will take a more 
formal look at ibis situation in Example VIII.1.8. 
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Proposition 1.9. The two bases correspond to two isomorphisms 


B9 a F, B*> b — F . 


Then 

A — — — > ft®B 

is an isomorphism, which corresponds to a matrix as seen in Lemma 2.1; we may 
call this matrix N B . Explicitly, the j-th column of N B is the image of the j-th 
element of A, viewed as a column vector in B® b . That is, letting 11 

A = (ai,...,a r ), £ = (bi, . . . ,b r ), 

we have N B = (ny)f=i,... , r , with 

j=l,— ,r 

r 

}) = X) n «# bi - 

t=l 


This equality is written in B.® b ; in practice one views the basis vectors aj, bi as 
vectors of F and would therefore simply write 

r 

a , =^«ijbi, 

i=l 

that is, the corresponding equality in F. 

Definition 2.4. The matrix N B is called the matrix of the change of basis. j 


Since it represents an isomorphism, the matrix of the change of basis is neces- 
sarily an invertible matrix. 

To our knowledge there is no established convention on the notation for this 
matrix, and in any case we would find it futile to try to remember any such con- 
vention. Any choice of notation will lead to a pleasant ‘calculus’: for example, the 
definition given above implies immediately Ng = (N® )~* arid NgN% = N%. In 
our experience, any such manipulation is immediately clarified by writing isomor- 
phisms out explicitly, so no convention is necessary. 

For example, let’s work out the action of a change of basis on the matrix 
representation of a homomophism a : F —> G of two free modules. The diagram 
taking care of the needed manipulations is 


ft®A ft ®C 



ft(SB ft®D 


Let A r ® be the matrix of i/ B = rji 1 o tp, as above, and let M§ be the matrix for 
He =<r- 1 op. 

^Ordering the elements of a basis is convenient, for example because it allows us to talk 
about the ‘j-th element’ of the basis. Hence we have the ‘tuple’ notation. 
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Choose the basis A for F and C for G. Then the matrix representing a will be 
the matrix P corresponding to 

If on the other hand we choose the basis B for F and D for G, then we represent 
a by the matrix Q corresponding to 

a~ l oao1> : B® 8 -* R® 0 . 

Now, 

< 7 -1 o a o ij> = o p -1 ) o a o (<p o (i/f ) -1 ) = p% 0 (p -1 oaotp)o (i^f )"*, 
and by Lemma 2.3 this shows 

This is hardly surprising, of course: starting from a vector expressed as a combina- 
tion of b’s, Ng converts it into a combination of a’s; F acts on it by giving its image 
under a as a combination of c’s; and M§ converts it into d’s. That accomplishes 
Q’s job, as it should. 

Summarizing this discussion, 

Proposition 2.5. Let a : F -* G be a homomorphism of finitely generated free 
modules, and let P be a matrix representing it with respect to any choice of bases 
for F and G. Then the matrices representing a with respect to any other choice of 
bases are all and only the matrices of the form 

MP-N, 

where M and N are invertible matrices. 

Definition 2.6. Two matrices P,Q e are equivalent if they represent 

the same homomorphism of free modules /?." — > R m up to a choice of basis. j 

This is manifestly an equivalence relation. Properly speaking, the ‘abstract’ 
homomorphism a : F — » G is not represented by one matrix as much as by the whole 
equivalence class with respect to this relation. Proposition 2.5 gives a computational 
interpretation of equivalence of matrices: P and Q are equivalent if and only if there 
are invertible M and N such that Q = M PN. 

2.3. Elementary operations and Gaussian elimination. The idea is uow to 
capitalize on Proposition 2.5, as follows: given a homomorphism a : F — * G between 
two free modules, find ‘special’ bases in F and G so that the matrix of a takes a 
particularly convenient form. That is, look for a particularly convenient matrix in 
each equivalence class with respect to the relation introduced in Definition 2.6. 

For this, we can start with random bases in F and G, representing a by a 
matrix P and then (by Proposition 2.5) multiply P on the right and left hy invertible 
matrices, in order to bring P into whatever form is best suited for our needs. 

There is an even more concrete way to deal with equivalence computationally. 
Consider the following three ‘elementary (row/column) operations’ that can be 
performed on a matrix P: 




2. Homomorph isms of free modules, I 


321 


• switch two rows (or two columns) of P; 

• add to one row (resp., column) a multiple of another row (resp., column); 

• multiply all entries in one row (or column) of P by a unit of R. 

Proposition 2.7. Two matrices P,Q € .M mt „(/?) are equivalent if Q may be 
obtained from P by a sequence of elementary operations. 

Proof. To see that elementary operations produce equivalent matrices, it suffices 
(by Proposition 2.5) to express them as multiplications on the left or right 12 by 
invertible matrices. Indeed, these operations may be performed by suitably multi- 
plying by the matrices obtained from the identity matrix by performing the same 
operation. For example, multiplying on the left by 

( 1 0 0 0 \ 

0 0 0 1 I 
0 0 10 
0 10 0 / 

interchanges the second and fourth row of a 4 x « matrix; multiplying on the right 
by 



adds to the third column of a m x 3 matrix the e-multiple of the first column. The 
diligent reader will formalize this discussion and prove that all these matrices are 
indeed invertible (Exercise 2.5). □ 

The matrices corresponding to the elementary operations am called elementary 
matrices. Linear algebra over arbitrary rings would be a great deal simpler if 
Proposition 2.7 were an ‘if and only if’ statement. This boils down to the question 
of whether every invertible matrix may be written as a product of elementary 
matrices, that is, whether elementary matrices generate the ‘general linear group*. 

Definition 2.8. The n-th general linear group over the ring R, denoted GL n (/?), 
is the group of units in M n (R), that is, the group of invertible n x n matrices with 
entries in R. j 

Brief mentions of this notion have occurred already; cf. Example II.1.5 and 
several exercises in previous chapters. 

The elementary matrices are elements of GL n (i?); in fact, the inverse of an 
elementary matrix is (of course) again an elementary matrix. The following obser- 
vation is surely known to the reader, in one form or another; in view of the foregoing 
considerations, it says that the relation introduced in Definition 2.6 is under good 
control over fields. 

1 Multiplying on the left acts on the rows of the matrix; multiplying on the right acts on its 
columns. 
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Proposition 2.9. Let R = k be a field, and letn>0 be an integer. Then GL n (fc) 
is generated by elementary matrices. 

Thus , two matrices are equivalent over a field if and only if they are linked by 
a sequence of elementary operations. 


Proof. Let A = (ay) be an n x n invertible matrix. In particular, some entry in 
the first column of A is nonzero; by performing a row switch if necessary, we may 
assume that an is nonzero. Multiplying the first row by a^ 1 , we may assume that 
<*n = 1: 


/ 1 

a 12 . 

• <*1»'' 

<*21 

<*22 • ■ 

d\n 

^«nl 

<*n2 • 

• <*!»«/ 


Adding to the second row the (-a 2 i)-multiple of the first row clears the (2, 1) entry. 
After performing the analogous operation on all rows, we may assume that the only 
nonzero entry in the first column is the (1, 1) entry: 


/I 

ai2 

<*ln^ 

0 

<*22 • ■ • 

<*ln 

to 

a n2 . . . 

arm) 


Similarly, adding to the second oolumn the {— <*i 2 )-mnltiple of the first column 
clears the (1,2) entry. Performing this operation on all columns reduces the matrix 
to the form 


(\ 

0 

• ° ^ 



0 

<*22 ■ 

► ain 

-f± 

0 1 

<0 

a„2 . 

► &nn) 

\TI 

A' ) 


where A' denotes a (clearly invertible) (n - 1) x (n - 1) matrix 13 . 

Repeating the process on A' and on subsequent smaller matrices reduces A to 
the identity matrix I n . In other words, I n may be obtained from A by a sequence 
of elementary operations: 


I n =M-A-N, 

where M and N are products of elementary matrices. But then 

A = M" 1 • AT 1 


is itself a product of elementary matrices, yielding the statement. 


□ 


Note that, with notation as in the preceding proof, A -1 = N • M\ thus, the 
process explained in the proof may be used to compute the inverse of a matrix (also 
cf. Exercise 3.5). 

When applied only to the rows of a matrix, the simplification of a matrix by 
means of elementary operations is called Gaussian elimination. This corresponds 

* 3 The vertical and horizontal lines alert the reader to the fact that the sectore of the matrix 
are themselves matrices; this notation is called a Slock matrix. 
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to multiplying the given matrix on the left by a product of elementary matrices, 
and it suffices in order to reduce any square invertible matrix to the identity (Ex- 
ercise 2.15). 

We will sloppily call ‘Gaussian elimination’ the more drastic process including 
column operations as well as row operations; this is in line with our focus on 
equivalence rather than ‘row-equivalence’. Applied to any rectangular matrix, this 
process yields the following: 

Proposition 2.10. Over a field, every mx n matrix is equivalent to a matrix of 
the form 


' Ir 

<r 

t 0 



(where r < min(m, n) and ‘0’ stands for null matrices of appropriate sizes). 

Different matrices of the type displayed in Proposition 2-10 are inequivalent 
(for example by rank considerations; cf. §3.3). Thus, Proposition 2.10 describes all 
equivalence classes of matrices over a held and shows that for any given m, n there 
are in fact only finitely many such classes (over a field!). 

2.4. Gaussian elimination over Euclidean domains. With due care, Gauss- 
ian elimination may be performed over every Euclidean domain. A 2 x 2 example 
should suffice to illustrate the general case: let 

(: 5 ) «"><*»• 

for a Euclidean domain R, with Euclidean valuation N. After switching rows and/or 
columns if necessary, we may assume that N{a) is the minimum of the valuations 
of all entries in the matrices. Division with remainder gives 

b = aq + r 

with r — 0 or N(r) < N(a). Adding to the second column the (— g)-multiple of the 
first produces the matrix 

( a r \ 

\^c d — qcj 

If r ^ 0 t so that N(r) < N(a), we begin again and shuffle rows and columns so that 
the (1, 1) entry has minimum valuation. This process may be repeated, but after a 
finite number of steps the (1, 2) entry will have to vanish: because valuations are 
nonnegative integers and at each iteration the valuation of the (1, 1) entry decreases. 

Trivial variations of the same procedure will clear the (2, 1) entry as well, 
producing a matrix 

(:»■ 

Now (this is the cleverest part) we claim that we may assume that e divides f in R, 
with no remainder. Indeed, otherwise we can add the second row to the first, 

f e 
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and start all over with this new matrix. Again, the effect of all the operations will 
be to decrease the valuation of the (1, 1) entry, so after a final number of steps we 
must reach the condition e | /. 

The reader wall have some fun describing this process in the general case. The 
end result is the following useful remark: 

Proposition 2.11. Let R. be a Euclidean domain, and let P 6 M m ,n(R)- Then P 
is equivalent to a matrix of the form 14 


/ di •• 

• 0 

0 > 

0 

• dr 

0 

\o- 

• 0 

0 / 


with d\ | • • • | d r . 


This is caBed the Smith normal form of the matrix. 

Proposition 2. 1 1 suffices to prove a weak form of one of our main goals (a classi- 
fication theorem for finitely generated modules over PIDs) over Euclidean domains; 
this wall hopefully be clear in a little while, but the reader can gain the needed in- 
sight right away by contemplating Proposition 2.11 vis-&-vis the classification of 
finite abelian groups (Exercise 2.19). 

Remark 2.12. As a consequence of the preceding considerations and arguing as 
in the proof of Proposition 2.9, we see that GL n (i?) is generated by elementary 
matrices if R is a Euclidean domain. The reader may think that some cleverness in 
handling Gaussian elimination may extend this to more general rings, but this will 
not go too far: there are examples of PIDs R. for which GL„ (i?) is not generated 
by elementary matrices. 

On the other hand, some cleverness does manage to produce a Smith normal 
form for any matrix with entries in a PID: the presence of a good gcd suffices to 
adapt the procedure sketched ahove (but one may need more than elementary row 
and column operations). We will take a direct approach to this question in §5. _j 


Exercises 


2.1. Prove that the subset of M 2 (R) consisting of matrices of the form 

n 

is a group under matrix multiplication and is isomorphic to (R, +). 

2.2. t> Prove that matrix multiplication is associative. [§2.1] 


14 Here r < min(m, n), the bottom-right 0 stands for a null (m — r) X {» — r) matrix, etc. 
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2.3. t> Prove that both M n (R) and Hom/j(i? n , i?") are i?-algebras in a natural way 
and the Injection Horn = jVl n (i?) of Corollary 2.2 is an isomorphism of 
i?-algebras. (Cf. Exercise III.5.9.) In particular, if the matrix M corresponds to 
the homomorphism ip : R n — * R* 1 , then M is invertible in M n (R) if and only if ip 
is an isomorphism. [§2.1, §3.2, §6.1] 

2 . 4 . Prove Corollary 2.2. 

2 . 5 . > Give a formal argument proving Proposition 2.7. [§2.3] 

2 . 6 . -> A matrix with entries in a field is in row echelon form if 

• its nonzero rows are all above the zeio rows and 

• the leftmost nonzero entry of each row is 1, and it is strictly to the right of the 
leftmost nonzero entry of the row above it. 

The matrix is further in reduced row echelon form if 


• the leftmost nonzero entry of each row is the only nonzero entry in its column. 

The leftmost nonzero entries in a matrix in row echelon form are called pivots. 

Prove that any matrix with entries in a field can be brought into reduced 
echelon form by a sequence of elementary operations on nows. (This is what is 
more properly called Gaussian elimination.) [2.7, 2.9] 

2.7. -i Let M be a matrix with entries in a field and in reduced row echelon form 
(Exercise 2,6). Prove that if a row vector r is a linear combination a;Tj of the 
nonzero rows of M, then a* equals the component of r at the position corresponding 
to the pivot on the i-th row of M. Deduce that the nonzero rows of M are linearly 
independent. [2.9] 


2.8. -i Two matrices M, N are row-equivalent if M = PN for an invertible matrix 
P. Prove that this is indeed an equivalence relation, and that two matrices with 
entries in a field axe row-equivalent if and only if one may be obtained from the 
other by a sequence of elementary operations on rows. [2.9, 2.12) 


2 . 9 . -> Let A; be a field, and consider row-equivalence (Exercise 2.8) on the set of mx 
n matrices Mm,n(.h). Prove that each equivalence class contains exactly one matrix 
in reduced row echelon form (Exercise 2.6). (Hint: To prove uniqueness, argue by 
contradiction. Let M, N be different row-equivalent reduced row echelon matrices; 
assume that they have the minimum number of columns with this property'. If the 
leftmost colnmn at which M and N differ is the fc-th column, use the minimality 
to prove that M, N may be assumed to be of the form 



or ( h-i | * ) • 


Use Exercise 2.7 to obtain a contradiction.) 

The uniqne matrix in reduced row echelon form that is row-equivalent to a 
given matrix M is called the reduced echelon form of M. [2.11] 
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2.10. t> The row space of a matrix M is the span of its rows; the column space of 
M is the span of its column. Prove that row-equivalent matrices have the same row 
space and isomorphic column spaces. [2.12, §3.3] 

2.11. Let k be a field and M € M m<n {k). Prove that the dimension of the space 
spanned by the rows of M equals the number of nonzero rows in the reduced echelon 
form of M (cf. Exercise 2.9). 

2.12. -i Let k be a field, and consider row-equivalence on (Exercise 2.8). 

By Exercise 2.10, row-equivalent matrices have the same row space. Prove that, 
conversely, there is exactly one row-equivalence class in M m ,n(k) for each subspace 
of k n of dimension < m. [2.13, 2.14] 

2.13. -> The set of suhspaces of given dimension in a fixed vector space is called 
a Grassmannian. In Exercise 2.12 you have constructed a bijectiou between the 
Grassmannian of r-dimensional subspaces of k n and the set of reduced row echelon 
matrices with n columns and r nonzero rows. 

For r = 1, the Grassmannian is called the projective space. For a vector space V, 
the corresponding projective space WV is the set of ‘lines’ (1-dimensional subspaces) 
in V. For V = ft", PV may be denoted P£ -1 , and the field k may be omitted if it is 
clear from the context. Show that P£ -1 may be written as a union is” -1 U k n ~ 2 U 
■ • • U k 1 U k°, and describe each of these subsets ‘geometrically’. 

Thus, P n_1 is the union of n ‘cells’ 15 , the largest one having dimension n — 1 
(accounting for the choice of notation). Similarly, all Grass mannians may be written 
as unions of cells. These are called Schubert cells. 

Prove that the Grassmannian of (n — l)-dimensional subspaces of k n admits a 
cell decomposition entirely analogous to that of IF]! -1 . (This phenomenon will be 
explained in Exercise V1II.5.17.) [VII.2.20, VIII.4.7, VIII.5.17] 

2.14. > Show that the Grassmannian Grt(2, 4) of 2-dimensional subspaces of k 4 is 
the union of 6 Schubert cells: k 4 U k 3 U fc 2 U k 2 U fc 1 U fc°. (Use Exercise 2.12; list 
all the possible reduced echelon forms.) [VIII.4.8] 

2.16. t> Prove that a square matrix with entries in a field is invertible if and only 
if it is equivalent to the identity, if and only if it is row-equivalent to the identity, 
if and only if its reduced echelon form is the identity. (§2.3, 3.5] 

2.16. Prove Proposition 2.10. 

2.17. Prove Proposition 2.11. 

2.18. Suppose a : Z 3 — ► Z 2 is represented by the matrix 

/ -6 12 18\ 

15 36 54/ 

with respect to the standard bases. Find bases of Z 3 , Z 2 with respect to which a 
is given by a matrix of the form obtained in Proposition 2.11. 


15 Here, a 'cell 1 is simply a subset endowed with a natural bisection with k* for some t. 
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2.19. t> Prove Corollary IV .6.5 again as a corollary of Proposition 2.11. In feet, 
prove the more general feet that every finitely generated abelian group is a direct 
sum of cyclic groups. (§11.6.3, §IV.6.1, §IV.6.3, §2.4, §4.1, §4.3] 


3. Homomorphisms of free modules, II 

The work in §2 accomplishes the goal of describing Horn r(F,G), where F and G 
are free fi-modules of finite rank; as we have seen, this can be done most explicitly 
if, for example, R is a field or a Euclidean domain. We are not quite done, though: 
even over fields, it is important to ‘understand the answer’ — that is, examine the 
classification of homomorphisms into finitely many classes, highlighted at the end 
of §2.3. Also, the frequent appearance of invertible matrices makes it necessary 
to develop criteria to tell whether a given matrix is or is not in the general linear 
group. We begin by pointing out a straightforward application of the classification 
of homomorphisms. 

3.1. Solving systems of linear equations. A moment’s thought reveals that 
Gaussian elimination, through the auspices of statements such as Proposition 2.9, 
gives us a tool to solve systems of linear equations over a field or a Euclidean 
domain 16 . This is another point which does not seem to war rant a careful treatment 
or memory' gymnastic: writing things out carefully should take care of producing 
tools as need be. To describe a typical situation, suppose 

f «u*i + ■ • ■ + ai„x n = bi 


Umi t ” "t a mn x n — b m 


is a system of m equations in n unknowns, with <iy , bj in a Euclidean domain R. 
We want to find all solutions *i, . . . , in R. 



‘solving for x’ in the matrix equation 



this amounts to 


A ■ x = b. 


Of course, if m = n and the matrix A is square and invertible, then the solutions 
are obtained simply as 

x = A -1 • b. 

Here A~ l may be obtained through Gaussian elimination (cf. Proposition 2.9) or 
through determinants (§3.2). 


ls Of course one can deal with systems over arbitrary integral domains R by reducing to the 
field case, by embedding R in its field of fractions. 
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Even without requiring anything special of the ‘matrix of coefficients’ A, row 
and column operations will take it to the standard form presented in Proposi- 
tion 2.11; that is, it will yield invertible matrices M, N such that 



with notation as in Proposition 2.11. Gaussian elimination is a constructive pro- 
cedure: watching yourself as you switch/combine rows or columns will produce M 
and N explicitly. Now, letting y = (jy) and c = Mb, the system 

(MAN)- y = c 

solves itself: 

’ d\y\ = ci 

- drVr = <V 
0 = c r+ i 

has solutions if and only if dj | Cj for all j and Cj — 0 for j > r: moreover, in this 
case y-j — d~ l Cj for j = 1, . . . ,r, and jy is arbitrary for j > r. This yields y, and 
the reader will check that x = Ny gives all solutions to the original system. 

Such arguments can be packaged into convenient explicit procedures to solve 
systems of linear equations. Again, it seems futile to list (or try to remember) any 
such procedure; it seems more important to know what is behind such techniques, 
so as to be able to come up with one when needed. 

In any case, the reader should be able to justify such recipes rigorously'. The 
most famous one is possibly Cramer’s rule (Proposition 3.6) which relies on de- 
terminants and is, incidentally, essentially useless in practice for any decent-size 
problem. 

3.2. The determinant. Let a : F — * G be a homomorphism of free i?-modules 
of the same rank, and let A be the matrix representing a with respect to a choice 
of bases for F and G. As a consequence of Lemma 2.3 (cf. Exercise 2.3), a is an 
isomorphism if and only if A is a unit in M n ( R), that is, if and only if it is invertible 
as a matrix with entries in Ft. This may be detected by computing the determinant 
of A. 

Definition 3.1. Let A - (ay) € M n (B) be a square matrix. Then the determinant 
of A is the element 

det(A)= £(-l)'n«to W ell. 

<reS n i=l 
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Here S„ denotes the symmetric group on {1, . . . ,n}, and we write 17 <r(i) for 
the action of a € S n on i 6 {1, . . . , «}; (— 1) CT is the sign of a permutation (Defini- 
tion IV .4.10, Lemma IV.4.12). 

The reader is surely familiar with determinants, at least over fields. They 
satisfy a number of remarkable properties; here is a selection: 

— The determinant of a matrix A equals the determinant of its transpose A t = 
(ay), defined by setting 



for all i and j (that is, the rows of A 1 are the columns of A). Indeed, 

det^‘ = £ (-D'fhW) « E (-D'fKwi = E (-irf[ «--*(.) 

aeSn i=i oeSn *=i <reS„ »=i 

by the commutativity of the product in f?; and <r _I ranges over all permutations 
of {1, ...» n} as 0 does the same, and with the same sign, so the right-most term 
equals det A. 

— If two rows or columns of a square matrix A agree, then det A = 0. Indeed, it 
is enough to check this for matching columns; the case for row's follows by applying 
the previous observation. If columns j and j' of A are equal, the contribution to 
det A due to a <j € S n is equal and opposite in sign to the contribution due to the 
product of a and the transposition (jf), so det ,4 = 0. 

— Suppose A — (ay) and B = (f>y) agree on all but at most one row: ay = 6y 
if i k, for all j and some fixed k. Let cy := ay = 6y for t k, cy := + 6ty, 

and let C := (cy). Then 

det(C) = det(A) + det(B). 

This follows immediately from Definition 3.1 and distributivity. Applying this 
observation to the transpose matrices gives an analogous statement for matrices 
differing at most along a column. 

We will record more officially the effect of elementary operations on determi- 
nants: 

Lemma 3.2. Let A be a square matrix with entries in an integral domain R. 

• Let A 1 be obtained from A by switching two rows or two columns. Then 
det(A') = — det(A). 

• Let A! be obtained from A by adding to a row ( column ) a multiple of another 
row (column). Then det(A') = det (A). 

• Let A' be obtained from A by multiplying a row (column) by an element 18 
c e R. Then det(A') = cdet(A). 

In other words, the effect of an elementary operation on det A is the same as mul- 
tiplying det A by the determinant of the corresponding elementary matrix. 


1 'Consistency with previous encounters with the symmetric group (e.g>, §IV.4) would demand 
that we write «r; but then we would end up with things like ‘auo’, which are very hard to parse. 
18 Fbr an elementary operation, c should be a unit; this restriction is not necessary here. 
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Proof. These are all essentially immediate from Definition 3.1. For example, 
switching two columns amounts to correcting each a in the definition by a fixed 
transposition, changing the sign of all contributions to the £ in the definition. The 
third point is immediate from distributivity. Combining the third operation and 
the two remarks preceding the statement yields the second point. Details are left 
to the reader (Exercise 3.2). □ 


This observation simplifies the theory of determinants drastically. If R — k is 
a field and P € M n (k), Gaussian elimination (Proposition 2.10) shows 


A — Ei • • • E a 



where r <n and Ei, E'j are elementary matrices. Then Lemma 3.2 gives that 


det(A) = JJdet(jEi) J| det(f?')det 

< 3 



(In particular, det A ft 0 only if r = n.) Useful facts about the determinant follow 
from this remark. Fbr example, 


Proposition 3.3. Let R be a commutative ring. 

• A square matrix A € Ad n (R) is invertible if and only if del A is a unit in R. 

• The determinant is a homomorphism 19 GL n (f?) — ► (/?*,)- More generally, 
for A,B € M n (R), 

det(A B) = det(A)det(13). 


Proof for R = a field. If R = k is a field, we can use the considerations imme- 
diately preceding the statement. The first point is reduced to the case of a block 
matrix 



for which it is immediate. In fact, this shows that det A = 0 if and only if the linear 
map k n — » k n corresponding to A is not an isomorphism. In particular, for all A, B 
we have det(AB) = 0 if and only if AB is not an isomorphism, if and only if A or B 
is not an isomorphism, if and only if det(A) = 0 or det(fi) =0. So we only need 
to check the homomorphism property for invertible matrices. These are products 
of elementary' matrices ‘on the nose’, and the homomorphism property then follows 
from Lemma 3.2. □ 


Before giving the (easy) extension to the case of arbitrary commutative rings, 
it is helpful to note the following explicit formulas. A ‘submatrix ’ obtained from a 
given matrix A by removing a number of rows and columns is called a minor of A; 
more properly, this term refers to the determinants of square submatrices obtained 


lsl Rec»U that {R* , ■) denotes the groups of units of R. 
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in these ways. If A 6 A4 n (R), the cofactors of A are the (n — 1) x (n — 1) minors 
of A, corrected by a sign. More precisely, for A = (ay) we will let 



(an ■■■ 

°lj-l 

a lj+l 

din 

A<«> := (-!)*+* det 

Oi-ll 

Oi+ll • • • 

di-lj-l 

Oi+lj-l 

Of-lj+l ’ 
Oi+iy+i • 

* tti-ln 

• di+ln 


k ®nl ‘ * 

&nj — 1 

( *nj -f 1 

Arm / 

Lemma 3.4. With notation as above, 




• for alli= 1 det( A) = £”=i 

• for all j = 1, ... , n, det(A) = £" =1 

OijA^'K 

oyAW). 




Proof. This is a simple (if slightly messy) induction on n, which we leave to the 
diligent reader. □ 


Of course Lemma 3.4 is simply stating the famous strategy computing a deter- 
minant by expanding it with respect to your favorite row or column. This works 
wonderfully for very small matrices and is totally useless for large ones, since the 
number of computations needed to apply it grows as the factorial of the size of the 
matrix. Prom a computational point of view, it makes much better sense to apply 
Gaussian elimination and use the considerations preceding Proposition 3.3. 

But Lemma 3.4 has the following important implication: 


Corollary 3.5. Let R. be a commutative ring and A € M n (R) ■ Then 


A ■ 


•A<n> .. 

. A< nl >\ 

/A( n > •• 

■ aw 

A(i«) ■ ■ 

. A< nn V 

\A<i") .. 

. A*”") 


■ A = det(A) 


Note the switch in the role of i and j in the matrix of cofactors. This matrix 
is called the adjoint matrix of A. 


Proof. Along the diagonal of the right-hand side, this is a restatement of Lemma 3.4. 
Off the diagonal, one is evaluating (for example) 

X>^ W) 

} =1 


for i' i. By Lemma 3.4 this is the same as the determinant of the matrix obtained 
by replacing the i-th row with the i'-th row; the resulting matrix has two equal rows, 
so its determinant is 0, as needed. □ 
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In particular, Corollary 3.5 proves that we can invert a matrix if we can invert 
its determinant: 

AA(“> AW 

A~* = det(A ) -1 I : • . , : 

y^( ln ) . . . 

this holds over any commutative ring, as soon as det A is a unit. In practice the 
computation of cofactors is ‘expensive’, so in any given concrete case Gaussian 
elimination is likely a better alternative (at least over fields); cf. Exercise 3.5. 

But this formula for the inverse has good theoretical significance. For example, 
we are now in a position to complete the proof of Proposition 3.3. 

Proof of Proposition 3.3 for commutative rings. The first point follows from 
the second and from what we have just seen. Indeed, we have checked that 
A £ M n (R) admits an inverse A -1 £ M n (R) if det(A) is a unit in R; conversely, 
if A admits an inverse A -1 £ then 

det(A)det(A _1 ) = det(AA _1 ) = det(/„) = I 
by the second statement, so that det(A _1 ) is the inverse of det(A) in R. 

Thus, we just need to verify the second point, that is, the homomorphism 
property of determinants. In order to verify’ this over every (commutative) ring, 
it suffices to verify the ‘universal* identity obtained by writing out the claimed 
equality, for matrices with indeterminate entries. For example, for n = 2 the 
statement is 

det ( Xl X 2 )det( yi y 2 )=det( Xm+X2V * + 

( 2:3 3 : 4 / j I 4 J \x 3 y 1 + s 4 »3 x 3 jf 2 + x A y A ) ’ 

which translates into the identity 

(Xi's* - x 2 X3){yiy 4 - Sftjfe) = fyilrt + + x 4 y 4 ) - (xijfe + x 2 j M)(x 3 jft + x 4 y 3 ); 

since this identity holds in Z(xi, . . . , 5 / 4 ], it must hold in any commutative ring, for 
any’ choice of X\, ... ,y 4 : indeed, Z is initial in Ring. 

Now, we have verified that the homomorphism property holds over fields; in 
particular it holds over the field of fractions of Z(xn , . . . , x„ n , f/ 11 , • ■ • , ynn}- It 
follows that it does hold in Z[xji, . . . , x nn , yn, . . . ,y nn ]> and we are done. □ 

The ‘universal identity’ argument extending the result from fields to arbitrary 
commutative rings is a useful device, and the reader is invited to contemplate it 
carefully. 

As an application of determinants (and especially cofactors) we can now go 
back to the special case of a system of n equations in n unknowns 

A - x = b, 

cf. §3.1, in the case in which det A is a unit. 

Proposition 3.8 (Cramer’s rule). Assume det(A) is a unit, and let A&) be the 
matrix obtained by replacing the j-th column of A by the column vector b. Then 

xj = det(A ) -1 det(A^). 
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Proof. Using Lemma 3.4, expand det(A y )) with respect to the j-th column: 

det(^>) = ^ J 4 (y )j» i . 

;=i 


Therefore 



— A -1 b - det(A) -1 



= det(j4) 1 


/ZUAWbA 

\Efai A (in) bJ 



( det(A) 'det^ 1 )) 
det(j4) _1 det(A^ n ^) 


which gives the statement. 


□ 


3.3. Rank and nullity. According to Proposition 2.10, each equivalence class of 
matrices over a field has a representative of the type 



The reason why two different m x n matrices of this type are surely inequi valent 
is that if a : V — ► W (with V and W free modules, vector spaces in this case) is 
represented by a matrix of this form, then r is the dimension of the image of a. 
Therefore, matrices with different r cannot represent the same a. 

This integer r is called the rank of the matrix and deserves some attention. We 
will discuss it for matrices with entries in a field, leaving generalizations to more 
general rings to the reader (for now at least). 

The column (row) space of a matrix P over a field k is the span of the columns 
(rows) of P. The column (row) rank of P is the dimension of the column (row) 
space of P. 

Proposition 3.7. The row rank of a matrix over afield k equals its column rank. 

Proof. Equivalent matrices have the same ranks. Indeed, let P € A4 m , n (h); the 
row space of P consists of all row vectors 

(ai - • • a m ) = (iq • • v m ) - P 

obtained as each v, ranges in k. If Q = MPN, with M and N invertible, let 
(wi - ■ • w m ) = («i v m ) ■ M -1 ; then 

(wi ••• w m ) Q-(vi ■■■ «m) M~ 1 (MPN) - (ai .. o m ) • N. 

This shows that multiplication on the right by N maps the row space of P (isomor- 
phically, since N is invertible) to the row space of Q\ thus the two spaces have the 
same dimension, as claimed. Minimal variations on the same argument show that 
the column ranks of P and Q agree. (Cf. Exercise 2.10.) 
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Applying this observation and Proposition 2.10 reduces the question to matrices 



for which row rank = r = column rank, proving the statement. □ 

The result upgrades easily to matrices over an arbitrary integral domain R 
(applying the usual trick of embedding R in its held of fractions). 

In view of Proposition 3.7, we can simply talk about the rank of a matrix: 

Definition 3.8. Let M € M m ,n(k) be a matrix over a field k. The rank of M is 
the dimension of its column (or, equivalently, row) space. _i 

One way to rephrase Proposition 2.10 is that matrices over a field are classified 
up to equivalence by their rank. 

The foregoing considerations translate nicely in more abstract terms for a linear 
map a : V —* W between finite-dimensional vector spaces over a field k. Using the 
convenient language introduced in §111.7.1, note that each a determines an exact 
sequence of vector spaces 

0 ► kero ► V > im « > 0 . 

Definition 3.9. The rank of or, denoted rk a, is the dimension of im o. The nullity 
of a is dim(ker a) . j 

Claim 3.10. Let a \ V —> W be a linear map of finite- dimensional vector spaces. 
Then 

(rank of o) + (nullity of o) = dim V. 

Proof. Let n = dim V and m = dim W. By Proposition 2.10 we can represent a 
by an n x m matrix of the form 



From this representation it is immediate that rk a = r and the nullity of a is n - r, 
with the stated consequence. □ 

Summarizing, rk a equals the (column) rank of any matrix P representing o; 
similarly, the nullity of a equals ‘dim V minus the (row) rank’ of P. Claim 3.10 is 
the abstract version of the equality of row rank and column rank. 

3.4. Euler characteristic and the Grothendieck group. Against our best 
efforts, we cannot resist extending these simple observations to more general com- 
plexes. Claim 3.10 may be reformulated as follows: 

Proposition 3.11. Let 

0 >U >V >W > 0 

be a short exact sequence of finite- dimensional vector spaces. Then 
dim(V) = dim(J7) + dim(W). 
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Equivalently, this amounts to the relation dim(V/U) = dim(V) — dim(f7). 
Consider then a complex of finite- dimensional vector spaces and linear maps: 


V , „ u r a N t t **JV-1 <*2 Ofl __ 

» : 0 > Vfi * Vj/_i ► > Vj > Vft f 0 

(cf. §111.7.1). Thus, ctj-i o ati = 0 for all i. This condition is equivalent to the 
requirement that im(a-j+i) C ker(at{); recall that the homology of this complex is 
defined as the collection of spaces 


s,(v.) . J=Ssl. 

un(a j+ i) 

The complex is exact if im(a,+i) = ker(a t ) for all i, that is, if Hi(V m ) = 0 for all i. 
Definition 3.12. The Euler characteristic of V . is the integer 
X(V.) := ^(-DMimTO. 


The original motivation for the introduction of this number is topological: with 
suitable positions, this Euler characteristic equals the Euler characteristic obtained 
by triangulating a manifold and then computing the number of vertices of the 
triangulation, minus the number of edges, plus the number of faces, etc. 

The following simple result is then a straightforward (and very useful) gener- 
alization of Proposition 3.11: 

Proposition 3.13. With notation as above, 

x(v.) = jr{-iydim mv.)). 

t=0 

In particular, if V, is exact, then x(V.) = 0. 


Proof. There is nothing to show for N = 0, and the result follows directly from 
Proposition 3.11 if IV = 1 (Exercise 3.15). Arguing by induction, given a complex 




^V N . 


0, 


we may assume that the result is known for ‘shorter’ complexes. Consider then the 
truncation 


V.': 0 




Then 

5 t(V.) = X (V; , ) + (-l) W dim(V N ), 

and 

Hi(V,) = H^V’) for 0 < i < N — 2, 

while 

ff*-i(V.') = ker(a /f _ 1 ), ff W -,(V.) = ^^, = kerfojv). 

By Proposition 3.11 (cf. Claim 3.10), 

dim(Vjy) = dhn(iin(aAf)) + dim(ker(a,v)) 
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and 

dim(jffjv_i(Vi)) = dim(ker(ojv-i)) - dim(im(oAr)); 

therefore 

dini(i/ J v-i(V'.')) - dim(Vjv) = dim(i/jv_i(V«)) - dim(fljv(Vi)). 

Putting all of this together with the induction hypothesis, 

N - 1 

X(K)= E (-I) 1 ' dim (^(K)) 

i=0 

gives 

x(V.) = x(K) + (-D W dim(V w ) 

N-l 

= £(-!)* + (-l)"dim(V N ) 

i =0 

= E(-l) i ^(H,-(V;')) + - dimCV*)) 

*=0 

jV-2 

= Et-^dim^V.)) + - dim(tf N (V.))) 

»=0 

jV 

j=0 

as needed, □ 

In tenns of the topological motivation recalled above, Proposition 3.13 tells us 
that the Euler characteristic of a manifold may be computed as the alternating sum 
of the ranks of its homology, that is, of its Betti numbers. 

Having come this far, we cannot refrain from mentioning the next, equally 
simple-minded, generalization. The reader has surely noticed that the only tool 
used in the proof of Proposition 3.13 was the ‘additivity’ property of dimension, 
established in Proposition 3.11: if 

0 >U >V ► W >0 

is exact, then 

dim(F) = dim(I/) + dim(W). 

Proposition 3.13 is a formal consequence of this one property o/dim. 

With this in mind, we can reinterpret what we have just done in the following 
curious way. Consider the category fc-Vect^ of finite-dimensional fc- vector spaces. 
Each object V of fc-Vect-^ determines an isomorphism class [V], Let F(k-Vect^) be 
the free abelian gr oup on the set of these isomorphism classes; further, let E be the 
subgroup generated by the elements 

\V\ — [U] — [W] 

for all short exact sequences 

0 >U >V >W >0 
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in fc-VectA The quotient group 

£> 

is called the Groihendieck group of the category fc-Vect-^ . The element determined 
by V in the Grothendieck group is still denoted [V]. 

More generally, a Grothendieck group may be defined for any category admitting 
a notion of exact sequence. 

Every complex V m determines an element in Jf(fc-Vect^), namely 

XK(V.) := £(-imi € K(fc-Vect^). 

* 

Claim 3.14. With notation as above, we have the following: 

• xk ‘w an Euler characteristic’, in the sense that it satisfies the formula given 
in Proposition 3.13: 

xk(v,) = £(-i)‘[ifi(v.)]. 

i 

• XK is a ‘universal Euler characteristic’, in the following sense. Let G be an 
abelian group, and let 8 be a function associating an element of G to each finite- 
dimensional vector space, such that <5(V) = $(F') ifV = V' and 8(V/U) = 
£(V) — 8(U). For V t a complex, define 

Xc(V.) = £(-l)*«W). 

i 

Then 8 induces a (unique) group homomorphism 
tf(fc-Vect') -» G 
mapping Xk(K) to xg(V,)- 

• In particular, 8 = dim induces a group homomorphism 

K(k-Vect f ) -> 2 

such that xk(V») *-*■ 

• This is in fact an isomorphism. 

The best way to convince the reader that this impressive claim is completely 
trivial is to leave its proof to the reader (Exercise 3.16). The first point is proved 
by adapting the proof of Proposition 3.13; the second point is a harmless mixture 
of universal properties; the third point follows from the second; and the last point 
follows from the fact that dim(fc) = 1 and the IBN property. 

The last point is, in fact, rather anticlimactic: if the impressively abstract 
Grothendieck group turns out to just be a copy of the integers, why bother defin- 
ing it? The answer is, of course, that this only stresses how special the category 
fc-Vect^ is. The definition of the Grothendieck group can be given in any context in 
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which complexes and a notion of exactness are available (for example, in the cate- 
gory of finitely generated 20 modules over any ring). The formal arguments proving 
Claim 3.14 will go through in any such context and provide us with a useful notion 
of ‘universal Euler characteristic’. 

We will come back to complexes and homology in Chapter IX. 


Exercises 


3.1 . Use Gaussian elimination to find all integer solutions of the system of equations 

J 7x-36y+ 12s = 1 , 

\ — 8.r + 42y — 14 z = 2. 

3.2. > Provide details for the proof of Lemma 3.2. [§3.2J 

3.3. Redo Exercise II.8.8. 

3.4. Formalize the discussion of ‘universal identities’: by what cocktail of universal 
properties is it true that if an identity holds in Z[a;i,. . .,a;r], then it holds over 
every commutative ring if, for every choice of x; e if? (Is the commutativity of R 
necessary?) 

3.5. > Let A be an n x n square invertible matrix with entries in a field, and 
consider the n x (2 n) matrix B = (A|i n ) obtained by placing the identity matrix 
to the side of A. Perform elementary row operations on B so as to reduce A to I„ 
(cf. Exercise 2.15). Prove that this transforms B into (i n |j4 -1 ). 

(This is a much more efficient way to compute the inverse of a matrix than by 
using determinants as in §3.2.) (§2.3, §3.2] 

3.6. -> Let R be a commutative ring and M = (mi, . . . ,m r ) a finitely generated 

( mA /0\ 

1 = I j I . Prove that 

m r J YV 

det(A)m.= 0 for all m £ M. (Hint: Multiply by the adjoint.) [3.?] 


3.7. -i Let if be a commutative ring, M a finitely generated f?-module, and let J 
be an idea] of if. Assume JM = M. Prove that there exists an element b e J such 
that (1 + b)M = 0. (Let mi, . . . , m r be generators for M. Find an r x r matrix 


B with entries in J such that 


r»i 


= B- 


tn i 


. Then use Exercise 3.6.) (3.8, 


VIII.1.18J 


\ m rJ 




20 Note that aorae finiteness condition is likely to be necessary. We cannot define a 
‘Grothendieck group of fc-Vect’ because the isomorphism classes of objects in fc-Vect do not form 
a set: there is one for each cardinal number, and cardinal numbers do not form a set. 





Exercises 


339 


3.8. -i Let E be a commutative ring, M be a finitely generated E-module, and let 
J be an ideal of R. contained in the Jacobson radical of B. (Exercise V.3.14). Prove 
that M = 0 4=& JM = M . (Use Exercise 3.7. This is Nakayarna’s lemma, a 
result with important applications in commutative algebra and algebraic geometry. 
A particular case was given as Exercise III.5.16.) [III.5.16, 3.9, 5.5] 

3.9. -> Let R. be a commutative local ring, that is, a ring with a single maximal 
ideal m, and let M, N be finitely generated E-modules. Prove that if M = mM + N, 
then M = N. (Apply Nakayama’s lemma, that is, Exercise 3.8, to M/JV. Note 
that the Jacobson radical of E is m.) [3.10] 

3.10. -i Let B. be a commutative local ring, and let M be a finitely generated E- 
module. Note that M/mM is a finite-dimensional vector space over the field E/m; 
let r»i, . . .,m r € M be elements whose cosets mod mM form a basis of M/mM. 
Prove that mi, . . . , m r generate M. 

(Show' that (mi, . . . ,m r ) + mM = M ; then apply Nakayama’a lemma in the 
form of Exercise 3.9.) (5.5, VIII.2.24] 

3.11. Explain how to use Gaussian elimination to find bases for the row space and 
the column space of a matrix over a field. 

3.12. -i Let E be an integral domain, and let M 6 with m < n. Prove 

that the columns of M are linearly dependent over E. [5.6] 

3.13. Let k be a field. Prove that a matrix M € M m ,n(k) has rank < r if and only 
if there exist matrices P € A4 m , r (fc), Q 6 M r ^,(k) such that M = PQ. (Thus the 
rank of M is the smallest such integer.) 

3.14. Generalize Proposition 3.11 to the case of finitely generated free modules 
over any integral domain. (Embed the integral domain in its field of fractions.) 

3.15. > Prove Proposition 3.13 for the case N = 1. [§3.4] 

3.16. > Prove Claim 3.14. [§3.4] 

3.17. Extend the definition of Grotheudieck group of vector spaces given in §3.4 to 
the category of vector spaces of countable (possibly infinite) dimension, and prove 
that it is the trivial group. 

3.18. Let Ab /s be the category of finitely generated abelian groups. Define a 
Grothendieck group of this category in the style of the construction of K(k-Ve ct*), 
and prove that E(Ab^ 9 ) = Z. 

3.19. -i Let Ab-^ be the category of finite abelian groups. Prove that assigning to ev- 
ery finite abelian group its order extends to a homomorphism from the Grothendieck 
group K(Ab^) to the multiplicative group (Q*, •). [3.20] 

3.20. Let E-Mod-^ be the category of modules of finite length (cf. Exercise 1.16) 
over a ring E. Let G be an abelian group, and let S be a function assigning an 
element of G to every simple E-module. Prove that 6 extends to a homomorphism 
from the Grothendieck group of E-Mod-^ to G. 

Explain why Exercise 3.19 is a particular case of this observation. 
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(Fbr another example, letting 6(M) = 1 € Z for every simple module M shows 
that length itself extends to a homomorphism from the Grothendieck group of 
to Z.) 


4. Presentations and resolutions 

After this excursion into the idyllic world of free modules, we can come back to earth 
and see if we have learned something that may be useful for more general situations. 
Modules over a field are necessarily free (Proposition 1.7), not so for modules over 
more general rings. In fact, this property will turn out to be a characterization of 
fields (Proposition 4.10). 

It is important that we develop some understanding of nonfree modules. In this 
section we will see that homomorphisms of free modules carry enough information 
to allow us to deal with many nonfree modules. 

4.1. Torsion. There are several ways in which a module M may fail to be free: 
the most spectacular one is that M may have torsion. 

Definition 4.1. Let M be an i?-module. An element m € M is a torsion element 
if {m} is linearly dependent, that is, if 3r e R, r ^ 0, such that rm = 0. The 
subset of torsion elements of M is denoted Torij(M). A module M is torsion-free 
if Tbrn(M) = {0}. A torsion module is a module M in which every element is a 
torsion element. j 

The subscript R. is usually omitted if there is no uncertainty about the base 
ring. 

A commutative ring is torsion-free as a module over itself if and only if it 
is an integral domain; this is a good reason to limit the discussion to integral 
domains in this chapter. Also, the reader will check (Exercise 4.1) that if R is an 
integral domain, then Tor(M) is a submodule of M. Equally easy is the following 
observation: 

Lemma 4.2. Submodules and direct sums of torsion-free modules are torsion-free. 
Free modules over an integral domain are torsion- free. 

Proof. The first statement is immediate; the second follows from the first, since 
an integral domain is torsion-free as a module over itself. □ 

Lemma 4.2 gives a good source of torsion-free modules: for example, ideals 
in an integral domain R are torsion-free (because they are submodules of the free 
module R 1 ). In fact, ideals provide us with examples of auother mechanism in 
which a module may fail to be free. 

Example 4.3. Let R — Z[x], and let I — (2,x). Then 1 is not a free i?-module. 
More generally, let I be any nonprincipal ideal of an integral domain R; then I is 
a torsion-free module which is not free. 
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Indeed, if I were free, then its rank would have to be 1 at most, by Propo- 
sition 1.9 (a basis for I would be a linearly independent subset of 17, and 17. has 
rank 1 over itself); thus one element would suffice to generate J, and I would be 
principal. -i 

What is behind this example is a characterization of PIDs in terms of ‘torsion- 
free submodules of a rank-1 free module’ (Exercise 4.3). This is a facet of the main 
result towards which we are heading, that is, the classification of finitely generated 
modules over PIDs (Theorem 5.6). The gist of this classification is that finitely 
generated modules over a PID can be decomposed into cyclic modifies. We have 
essentially proved this fact already for modules over Euclidean domains (it follow's 
from Proposition 2.11: see Exercise 2.19), and we have looked in great detail at the 
particular case of Z-modules, a.k.a. abelian groups (§IV.6); we are almost ready to 
deal with the general case of PIDs. 

Definition 4.4. An 17-module M is cyclic if it is generated by a singleton, that is, 
if M = R/I for some ideal / of i7. j 

The equivalence in the definition is hopefully clear to our reader, as an immedi- 
ate consequence of the first isomorphism theorem for modules (Corollary III.5.16). 
If not, go back and (re)do Exercise III.6.16. 

Cyclic modules are witness to the difference between fields and more general 
rings: over a field fc, a cyclic module is just a 1-dimensional vector space, that is a 
‘copy of fc’; over more general rings, cyclic modules may be very interesting (think 
of the many hours spent contemplating cyclic groups). In fact, we can tell that a 
ring is a field by just looking at its cyclic modifies: 

Lemma 4.5. Let R be an integral domain. Assume that every cyclic R-module is 
torsion-free. Then R is a field. 


Proof. Let c € R, c 0; then M = f?/(c) is a cyclic module. Note that Tbr(M) = 
M: indeed, the class of 1 generates R/(c) and belongs to Tor(M) since c • 1 is 0 
mod (c) and 0. However, by hypothesis M is torsion-free; that is, Tor(M) = 
{0}. Therefore M = Tbr(M) is the zero module. 

This shows R/(c) is the zero 17-module; that is, (c) = (1). Therefore, c is a 
unit. Thus every nonzero element of J? is a unit, proving that 17 is a field. □ 


Lemma 4.5 is a simple-minded illustration of the fact that we can study a ring R 
by studying the module structure over R, that is, the category' 17-Mod, and that 
we may not even need to look at the whole of 17-Mod to be able to draw strong 
conclusions about R. 


4.2. Finitely presented modules and free resolutions. The ‘right’ way to 
think of a cyclic i?-module M is as a module which admits an epimorphism from R, 
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viewing the latter as the free rank-1 ii-module 21 : 

R} >M >0. 

The fact that M is surjected upon by a free i?-module is nothing special. In fact, 
every module M admits such an epimorphism: 

R® a J- M > 0 

provided that we are willing to take A large enough; if we are desperate, A = M 
will surely do. This is immediate from the universal property of free modules; if 
the reader does not agree, it is time to go back and review §111.6.3. What makes 
cyclic modules special is that A can be chosen to be a singleton. 

We are now going to focus on a case which is also special, but not quite as 
special as cyclic modules: finitely generated modules are modules for which we can 
choose A to be a finite set (cf. §111.6.4). Thus, we will assume that M admits an 
epimorphism from a finite-rank free module: 

m-Z-tM >0 

for some integer to. The image by tt of the to vectors in a basis of R m is a set of 
generators for M. 

Finitely generated modules are much easier to handle than arbitrary modules. 
For example, an ideal of R can tell us whether a finitely generated module is torsion. 

Definition 4.6. The a nnihilator of an /f-module M is 

Aun/i(M) := {r e J?|Vro e M,rm = 0}. j 

The subscript is usually omitted. The reader will check (Exercise 4.4) that 
Ann(M) is an ideal of R and that if M is a finitely generated module and R is an 
integral domain, then M is torsion if and only if Ann(M) 0. 

We would like to develop tools to deal with finitely generated modules. It 
turns out that matrices allow us to describe a comfortably large collection of such 
modules. 

Definition 4.7. An f?- module M is finitely presented if for some positive integers 
to, n there is an exact sequence 

B n — ^ R m >M >0. 

Such a sequence is called a presentation of M. _i 

In other words, finitely presented modules are cokernels (cf. III.6.2) of homo- 
morphisms between finitely generated free modules. Everything about M must 
be encoded in the homomorphism <p\ therefore, we should be able to describe the 
module M by studying the matrix corresponding to <p. 

There is a gap between finitely presented modules and finitely generated mod- 
ules, but on reasonable rings the two notions coincide: 

»ln context the exactness of a sequence of ^-modules wjU be understood, so the displayed 
sequence is a way to denote the fact that there exists a surjective homomorphism of J?rraodules 
from R to Afj cf. Example III.7.2. Also note the convention of denoting R by R . 1 when it is viewed 
as a module over itself. 
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Lemma 4.8. If R is a Noetherian ring, then every finitely generated R.-module is 
finitely presented. 

Proof. If M is a finitely generated module, there is an exact sequence 

pm M > o 

for some to. Since R is Noetherian, R m is Noetherian as an /^-module (Corol- 
lary m.6.8). Thus kerir is finitely generated; that is, there is an exact sequence 

/?," ► kerir > 0 

for some n. Putting together the two sequences gives a presentation of M. □ 

Once we have gone one step to obtain generators and two steps to get a pre- 
sentation, we should hit upon the idea to keep going: 

Definition 4.9. A resolution of an ff-modnle M by finitely generated free modules 
is an exact complex 

*R? n * ). tpmt » pmo )■ M > 0. J 

Iterating the argument proving Lemma 4.8 shows that if R is Noetherian, then 
every finitely generated module has a resolution as in Definition 4.9. 

It is an important conceptual step to realize that M may be studied by studying 
an exact complex of free modules 

► R m * > R m * > R m ' > R ma 

resolving M, that is, such that M is the cokernel of the last map. The R m ° piece 
keeps track of the generators of M\ R m ' accounts for the relations among these 
generators; /?. m * records relations among the relations; and so on. 

Developing this idea in full generality would take us too far for now: for exam- 
ple, we would have to deal with the fact that every module admits many different 
resolutions (for example, we can bump up every to; by one by direct-summing each 
term in the complex with a copy of J? 1 , sent to itself by the maps in the complex). 
We will do this very carefully later on, in Chapter IX. 

However, we can already learn something by considering coarse questions, such 
as ‘how long’ a resolution can be. A priori, there is no reason to expect a free 
resolutions to be ‘finite’, that is, such that m< = 0 for i » 0. Such finiteness 
conditions tell us something special about the base ring R. 

The first natural question of this type is, for which rings R. is it the case that 
every finitely generated /?-module M has a free resolution ‘of length O’, that is, 
stopping at Too? That would mean that there is an exact sequence 

0 > R ma > M >0. 

Therefore, M itself must be free. What does this say about R! 

Proposition 4.10. Let R. be an integral domain. Then R. is a field if and only if 
every finitely generated R-module is free. 
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Proof. If R is a field, then every /^-module is free, by Proposition 1.7. For the 
converse, assume that every finitely generated fl-module is free; in particular, every 
cyclic module is free; in particular, every cyclic module is torsion-free. But then R 
is a field, by Lemma 4.5. □ 

The next natural question concerns rings for which finitely generated modules 
admit free resolutions of length 1. It is convenient to phrase the question in stronger 
terms, that is, to require that for every finitely generated jR-module M and every 
beginning of a free resolution 


fr n ° M > 0, 

the resolution can be completed to a length 1 free resolution. This would amount 
to demanding that there exist an integer mi and an B-module homomorphism 
R mi — > /? m ° such that the sequence 

0 > R™ 1 * R m <> > M > 0 

is exact. Equivalently, this condition requires that the module kertf of relations 
among the mo generators necessarily be free. 

Claim 4.11. Let R. be an integral domain satisfying this property. Then R is 
a PID. 

Proof. Let I be an ideal of R., and apply the condition to M = R./I. Since we 
have an epimorphism 

rI-L+r/I > 0 , 

the condition says that ker n is free; that is, I is free. Since / is a free submodule 
of R, which is free of rank 1, I must be free of rank < 1 by Proposition 1.9. 
Therefore I is generated by one element, as needed. □ 

The classification result for finitely generated modules over PIDs (Theorem 5.6), 
which we keep bringing up, will essentially be a converse to Claim 4.11: the mys- 
terious condition requiring free resolutions of finitely generated modules to have 
length at most 1 turns out to be a characterization of PIDs, just as the length 0 
condition is a characterization of fields (as proved in Proposition 4.10). We will 
work this out in §5.2. 

4.3. Reading a presentation. Let us return to the brilliant idea of studying a 
finitely presented module M by studying a homomorphism of free modules 

(*) <p : R n — ♦ f?. m 

such that M - coker <p. As we know', we can describe y completely by considering 
a matrix A representing it, and therefore we can describe any finitely presented 
module by giving a matrix corresponding to (a homomorphism corresponding to) it. 
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In many cases, judicious use of the material developed in §2 allows us to deter- 
mine the module M explicitly. For example, take 



this matrix corresponds to a homomorphism Z 2 — ► Z 3 , hence to a Z-module, that is, 
a finitely generated abelian group G. The reader should figure out what G is more 
explicitly (in terms of the classification of §IV.6; cf. Exercise 2.19) before reading 
on. In the rest of this section we will simply tie up loose ends into a more concrete 
recipe to perform these operations. 

Incidentally, a number of software packages can perform sophisticated oper- 
ations on modules (say, over polynomial rings); a personal favorite is Macaulay2. 
These packages rely precisely on the correspondence between modules and matrices: 
with due care, every operation on modules (such as direct sums, tensors, quotients, 
etc.) can be executed on the corresponding matrices. For example. 


Lemma 4.12. Let A, B be matrices with entries in an integral domain R, and let 
M, N denote the corresponding R-modules. Then M © N corresponds to the block 
matrix 



Proof. This follows immediately from Exercise 4.16. 


□ 


Coming back to (*), note that the module M cannot know which bases we 
have chosen for R n or R m \ that is, M = coker really depends on the homomor- 
phism tp, not on the specific matrix representation we have chosen for <p. This is an 
issue that we have already encountered, and treated rather thoroughly, in §2.2 and 
following: ‘equivalent’ matrices represent the same homomorphism and hence the 
same module. In the context we are exploring now, Proposition 2.5 tells us that 
two matrices A, B represent the same module M if there exist invertible matrices 
P, Q such that B = PAQ. 

But this is not the whole story. Two different homomorphisms <p\, may 
have isomorphic cokemels, even if they act between different modules: the extreme 
case being any isomorphism 

R m — * R m , 

whose cokernel is 0 (regardless of the isomorphism and no matter what m is). 
Therefore, if a matrix A' corresponds to a module M, then (by Lemma 4.12) so 
does the block matrix 



where I T is the r x r identity matrix (and r is any nonnegative integer); in fact, I r 
could be replaced here by any invertible matrix. 

The following proposition attempts to formalize these observations. 
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Proposition 4.13. Let A be a matrix with entries in an integral domain R, and 
let B be obtained from A by any sequence of the following operations: 

• switch two rows or two columns; 

• add to one row (resp., column) a multiple of another row (resp., column); 

• multiply all entries in one row (or column) by a unit of JR.; 

m if a unit is the only nonzero entry in a row (or column), remove the row and 
column containing that entry. 

Then B represents the same R-module as A, up to isomorphism. 


Proof. The first three operations are the ‘elementary operations’ of §2.3, and they 
transform a matrix into an equivalent one (by Proposition 2.7); as observed above, 
this does not affect the corresponding module, up to isomorphism. 


As for the fourth operation, if u is a unit and the only nonzero entry in (say) a 
row, then by applications of the second elementary operation we may assume that 
u is also the only nonzero entry in its column; without loss of generality we may 
assume that u is in fact the (1,1) entry of the matrix; that is, the matrix is in block 
form; 



But then A and A' represent the same module, as needed. 


□ 


Example 4.14. The matrix with integer entries 



determines an abelian group G. Subtract three times the first column from the 
second column, obtaining 



the (1, 1) entry is a unit and the only nonzero entry in the first row, so we can 
remove the first row and column: 

G)' 

now change the sign, and subtract twice the first row from the second, leaving 



Therefore G is isomorphic to the cokernel of the homomorphism 


<p:Z — ►zez 


mapping 1 to (3,0). This homomorphism is injective and identifies Z with the 
subgroup 3Z © 0 of the target. Therefore 


G — coker <p SS 


z@z _ z „ 

^ /is rff 

3Z©0 " 3Z 


J 
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By virtue of Gaussian elimination, the ‘algorithm’ implicitly described in Propo- 
sition 4.13 will work without fail over Euclidean domains (e.g., over the polynomial 
ring in one variable over a field), in the sense that it will identify the finitely 
generated module corresponding to a matrix with an explicit direct sum of cyclic 
modules, as in Example 4.14. This is too much to expect over more general rings, 
since in general elementary transformations do not generate GL; cf. Remark 2.12. 


Exercises 


4.1. > Prove that if E is an integral domain and M is an f?- module, then Tor (M) 
is a submodule of M. Give an example showing that the hypothesis that E is an 
integral domain is necessary. [§4.1j 

4.2. > Let M be a module over an integral domain R, and let N be a torsion-free 
module. Prove that Homj?(M, IV) is torsion-free. In particular, Hoiur(M, E) is 
torsion-free. (We will mu into this fact again; see Proposition VIII.5.16.) [§VIII.5.5] 

4.3. > Prove that an integral domain E is a PID if and only if every submodule 
of R itself is free. [§4.1, 5.13] 

4.4. > Let R be a commutative ring and M an E-module. 

• Prove that Ann(M) is an ideal of R. 

• If R. is an integral domain and M is finitely generated, prove that M is torsion 
if and only if Ami(M) / 0. 

• Give an example of a torsion module M over an integral domain, such that 
Ann(M) = 0. (Of course this example cannot be finitely generated!) 

l§4.2, §5.3] 

4.5. -i Let M be a module over a commutative ring R. Prove that an ideal I of R 
is the annihilator of an element of M if and only if M contains an isomorphic copy 
of R/I (viewed as an E-module). 

The associated primes of M are the prime ideals among the ideals Ann(m), 
for me M. The set of the associated primes of a module M is denoted Assr(M). 
Note that every prime in Assj?(A/) contains Aiuir(M). [4.6, 4.7, 5.16] 

4.6. -i Let M be a module over a commutative ring E, and consider the family 
of ideals Ann(m), as to ranges over the nonzero elements of M. Prove that the 
maximal elements in this family are prime ideals of E. Conclude that if E is 
Noetherian, then Assr(M) / 0 (cf. Exercise 4.5). (4.7, 4.9] 

4.7. -i Let R be a commutative Noetherian ring, and let M be a finitely generated 
module over E. Prove that M admits a finite series 

M = M 0 2 Mi 2 • • • 2 M m = (0) 

in which all quotients Mi/Mi+i are of the form E/p for some prime ideal p of E. 
(Hint: Use Exercises 4.5 and 4.6 to show that M contains an isomorphic copy M' 
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of B./pi for some prime pi- Then do the same with M/M', producing an M" 3 M' 
such that M" /M' = i?/p 2 for some prime P 2 . Why must this process stop after 
finitely many steps?) [4.8] 

4.8. Let R be a commutative Noetherian ring, and let M be a finitely generated 
module over R. Prove that every prime in Assr(M) appears in the list of primes 
produced by the procedure presented in Exercise 4.7. (If p is an associated prime, 
then M contains an isomorphic copy N of R/p. With notation as in the hint in 
Exercise 4.7, prove that either pi = p or N n M' = 0. In the latter case, N maps 
isoinorphically to a copy of R/p in M/M'; iterate the reasoning.) 

In particular, if M is a finitely generated module over a Noetherian ring, then 
Ass(M) is finite. 

4.9. Let M be a module over a commutative Noetherian ring R. Prove that the 
union of all annihilators of nonzero elements of M equals the union of all associated 
primes of M. (Use Exercise 4.6.) 

Deduce that the union of the associated primes of a Noetherian ring R. (viewed 
as a module over itself) equals the set of zero-divisors of R. 

4.10. Let R. be a commutative Noetherian ring. One can prove that the minimal 
primes of Ann(M) (ef. Exercise V.1.9) are in Ass(M). Assuming this, prove that 
the intersection of the associated primes of a Noetherian ring R. (viewed as a module 
over itself) equals the nilradical of R. 

4.11. Review the notion of presentation of a group, (§11.8.2), and relate it to the 
notion of presentation introduced in §4.2. 

4.12. Let p be a prime ideal of a polynomial ring k\x i, over a field k, and 

let R = fc[a;i, . . . , x n \/p. Prove that every finitely generated module over R. has a 
finite presentation. 

4.13. -> Let R be a commutative ring. A tuple (oj , 02 , . . . , a„) of elements of R. is a 
regular sequence if or is a non-zero- divisor in R, 02 is a non-zero-divisor modulo 22 
( 01 ), «3 is a non-zero-divisor modulo ( 01 , 02 )) and so on. 

For a, b in R, consider the following complex of B-modules: 

(*) 0 >R-^R®R-^-iR— M) 

where ir is the canonical projection, di (r, a) = ro + sb, and di{t) = (bt, —at). Put 
otherwise, d\ and <h correspond, respectively, to the matrices 

*)’ (-a)' 

• Prove that this is indeed a complex, for every a and b. 

• Prove that if (o, b) is a regular sequence, this complex is exact. 

The complex (*) is called the Koszul complex of (a, 6). Thus, when (a, 6) is a 
regular sequence, the Koszul complex provides us with a free resolution of the 
module R/(a, b). [4.14, 5.4, VIII.4.22] 

22 That is, the clasB of 02 in R/{a\) is a non-zero-divisor in B/[a \ ). 
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4.14. -> A Koszul complex may be defined for any sequence a\, ... ,o„ of elements 
of a commutative ring R- The case n = 2 seen in Exercise 4.13 and the case n— 3 
reviewed here will hopefully suffice to get a gist of the general construction; the 
general case will be given in Exercise V1II.4.22. 

Let a, b, c € R. Consider the following complex: 

0 > R-^ R® R® R-^ R.® R.® R-^ R — ^ M) 


where 7r is the canonical projection and the matrices for dj, dj, da are, respectively, 
(a b c) , 



Prove that this is indeed a complex, for every a, b, c. 

Prove that if (a, b, c) is a regular sequence, this complex is exact. 


Koszul complexes are very important in commutative algebra and algebraic geom- 
etry. [Vin.4.22] 


4.15. > View Z as a module over the ring R = Z[x, y\, where x aud y act by 0. 
Find a free resolution of Z over R. 


4.16. > Let ip : R n — * R™ and ijj : R? —* Ri be two ^-module homomorphisms, 
and let 

ip © : R n © RP -> R m © R* 
be the morphism induced on direct sums. Prove that 

coker(<p © ip) = coker <p © coker ip. 

[VUI.4.21] 

4.17. Determine (as a better known entity) the module represented by the matrix 

( 1 + 3x 2x 3x\ 

1 + 2x l + 2x — x z 2x1 
x x 2 x / 

over the polynomial ring fc[x] over a field. 


5. Classification of finitely generated modules over 
PIDs 

It is finally time to prove the classification theorem for finitely generated modules 
over arbitrary PIDs. We hare already proved this statement in the special case of 
finite Z-modules (§IV.6), and the diligent reader has worked out a proof in the less 
special case of finitely generated modules over Euclidean domains, in Exercise 2.19. 
Now we go for the real thing. 
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5.1. Submodules of free modules. Recall (Lemma 4.2, Example 4.3) that a 
submodule of a free module over an arbitrary integral domain It. is necessarily 
torsion- free but need not be free. For example, the ideal / = (*, y) of R = k[x,y\ 
(with k a field, for example) is torsion-free as an /?-module, but not free: for this 
to become really, really evident, it is helpful to rename x = a, y = b when these 
are viewed as elements of I and observe that o and b are not linearly independent 
over k\x,y\, since ya — xb = 0. 

On the other hand, submodules of a free module over a field are automatically 
free: simply because every module over a field is free (Proposition 1.7). It is 
reasonable to expect that ‘some property in between’ being a field and being a 
friendly UFD such as fc[a:, y] will guarantee that a submodule of a free module is 
free. We will now prove that tills property is precisely that of being a principal 
ideal domain. 

Proposition 5.1. Let R be a PID, let F be a finitely generated free module over R, 
and let M C F be a submodule. Then M is free. 

We will actually prove a more precise result, in view of the full statement of 
the classification theorem: we will show that there is a basis (®i, . . . ,ar„) of F and 
elements of R (with m<n) such that 

yi = Oia-’i, . . . , y m = a m x m 

form a basis of M. That is, not only do we prove that M is free, but we also show 
that there are ‘compatible’ bases of F and M. 

In order to do this, we may of course assume that M 0: otherwise there is 
nothing to prove. Most of the work will then go into showing that if M =£ 0, we can 
split one direct summand off M; iterating this process will prove the proposition 23 . 
This is where the PID condition is used, so we will single out the main technical 
point into the following statement. 

Lemma 5.2. Let R, be a PID, let F be a finitely generated free module over R, and 
let M C F be a nonzero submodule. Then there exist a € R, x € F, y e M, and 
submodules F 1 C F and M' C M, such that y = ax, M' = F‘ C\ M, and 

F = (x)<& F', M={y)<& M'. 

The reader would find it instinctive to pause a moment and try to imagine how 
the PID hypothesis may enter into the proof of this lemma. The PID hypothesis is 
a hypothesis on R, so the question is, where is the special copy of R to which we 
can apply itV Don’t read on until you have spent a moment thinking about this. 

It is tempting to look for this copy of R among the ‘factors’ of F — if": for 
example, we could map F to R. by projecting onto the first component: 

ir(ri,...,r n ) = n. 

But there is nothing special about the first component of If"; in fact there is 
nothing special about the particular basis chosen for F, that is, the particular 

^In a loose sense, this is the same strategy we employed in the proof of the classification 
result for finite abelian groups in §IV.€. 
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representation of JP as R! 1 . Therefore, this does not look too promising. The way 
out of this bind is to democratically map F to R in every possible way: we consider 
all homomorphisms y> : F — * R. This is a set that does not depend on any extra 
choice, so it is more likely to carry the information we need. 

For each tp, <p(M) is a submodule of R, that is, an ideal of R\ so we have a 
chance to use the PID hypothesis with profit, hi fact, the much weaker fact that 
R is Noetherian guarantees already that there must be some homomorphism or for 
which o(M) is maximal among all ideals <p(M). This copy of R, that is, the target 
of such a homomorphism a, is special for a good reason, independent of inessential 
choices. The fact that R is a PID tells us that a(M) is principal, and then we are 
clearly in business. 

Here is the formal argument: 

Proof. For all <p £ Homj?(F, R), <p{M) is a submodule of R, that is, an ideal. 
The family of all these ideals is nonempty, and PIDs are Noetherian; therefore (by 
Proposition V.1.1) there exists a maximal element in the family, say a(M), for a 
homomorphism a : M — * R. The fact that M^O implies immediately that some 
<p(M) ^ 0 (for example, take for <p the projection to a suitable factor of R n )\ hence 

a(M) / 0. 

Since R. is a PID, a(M) is principal: a(M) = (a) for some a € R, a £ 0. Since 
a € q(M), there exists an element y € M such that a(y) = o. These are the 
elements a, y mentioned in the statement. 

We claim that o divides <p(y) for all ip € Hoiiir(F, R). Indeed, let b be a 
generator of (a, <p(y)) (which exists since R. is a PID; of course b is simply a gcd of 
a and <p{y)) , and let r,s € R such that b — ra + s<p( y)\ consider the homomorphism 
ip :=ra + sp>. Since a € (6), we have a(M) C (6). On the other hand 

b = ra + stp(y) = (ra + sip) (y) = ip{y) £ 

therefore (6) C ip(M). It follows that a(M) C ip(M), and by maximally a(M) = 
ip(M)\ hence (a) = (6), and in particular a divides <p(y), as claimed. 

Let y = (*i,.. . , s n ) as an element of F = R" ■ Each Si is the image of y by a 
homomorphism F —* R (that is, the i-th projection), so a divides all of them by 
what we just proved. Therefore 3rj, . . . ,r„ € R such that s; = arp, let 

* = (*■!,..., r„) € F. 

This is the element a; mentioned in the statement. By construction, y = ax. Fur- 
ther, a = a(j/) = a (ax) — aa(a;); since R. is an integral domain and a ^ 0, this 
implies a(x) = 1. 

Finally, we let F' = kera and M' = F' fl M, and we can proceed to verify the 
stated direct sums. 

First, every z € F may be written as 

z = a(z)x + (z — <x(z)x); 

by linearity 

<x{z — a(z)x) = a(z) — a(z)ot(x) = a(z) - a (z) = 0, 




352 


VI. Linear algebra 


that is, z — a(z)x e kera. This implies that F = (x) + F'. On the other hand, 
rx € F' =£• a(rx) = 0 =$■ ra(x) = 0 ==> r = 0: that is, (x) fl F' = 0. 
Therefore 

F = (x) © F'. 

as claimed (cf. Exercise 5.1). 

Second, if z € A/, then a divides a(s): indeed, a(z) 6 a(M) = (a). Writing 
a(i) = co, we have a(z)x = cax = c y: splitting z as above, we note 

z — a(z)x = z — cy€MnF' = A/', 

and this leads as before to 

M = {y) © A/', 

concluding the proof. □ 

Once Lemma 5.2 is established, the proof of Proposition 5.1 is mere busvwork: 

Proof of Proposition 5 . 1 . If M = 0, we are done. If not, applying Lemma 5.2 
to M C F produces an element y\ € A/ and a submodule M (1) C M such that 

M = (Vi) © A/ (,) . 

If M (1) = 0, we are done: otherwise, apply Lemma 5.2 again (to M (1) C F) to 
obtain yi 6 A/ (1) and A/ (2) C A/ (1) so that 

A/ = (»,>©((jt2>©A/‘ 2 '). 

This process may be continued, producing elements !/i, . . . , y m € M such that 
A/ = (yi> ©•••©(!/„,) ©A/»’">, 

so long as the module A/ (m) is nonzero. However, by Proposition 1.9 we know 
m < n, since yi,...,y„, are linearly independent in F. It follows that the process 
must stop: that. is. A/ (ra) = 0 for some m < n. That is, 

A / = (yi) © • • • © (y m ) 

is free, as needed. □ 

Note that t he proof has not used part of the result of Lemma 5.2, that is, the 
fact that the 'factor' (y) of M is a submodule of a corresponding factor (x) of F. 
This is needed in order to upgrade Proposition 5.1 along the lines mentioned after 
its statement. Here is that stronger statement: 

Corollary 5 . 3 . Let R be a PID. let F be a finitely generated free module over R. 
and let M C F be a submodule. Then there exist a basis (xi,...,x„) of F and 

nonzero elements « I a tn of R (m < n) such that (aiXi ,...,a m x m ) is a basis 

of M. Further, we may assume n t | « 2 | • • • | . 

This statement should be compared with Proposition 2.11: it amounts to a 
‘Smith normal form over PIDs’: cf. Remark 2.12. 
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Proof. Now that we know that submodules of a free module are free, we see that 
the submodule F' C F produced in Lemma 5.2 is free. The first part of the 
statement then follows from Lemma 5.2, by an inductive argument analogous to 
the proof of Proposition 5.1, and is left to the reader. 

The most delicate part of the statement is the divisibility condition. By induc- 
tion it suffices to prove «i | a 2 , and for this we refer back to the proof of Lemma 5.2: 
(ai) is maximal among the ideals <p{M) of B, as p ranges over all homomorphisms 
F -» R. Consider then any 24 homomorphism such that y’fai) = <p(x 2 ) = 1. 
Since <p(yi) = y(ttjxi) = «i, we have (ai) C by maximality (ai) = <p(M). 

Therefore aj = <p( 02 X 2 ) = ^( 3 / 2 ) € y>(M) = (ax), proving aj | 02 as needed. □ 

The logic of the argument given in this section is somewhat tricky: it takes 
Lemma 5.2 to prove Proposition 5.1; but then it takes Proposition 5.1 (proving 
that P is free) to revisit Lemma 5.2 and squeeze from it the stronger Corollary 5.3. 

5.2. PIDs and! resolutions. Proposition 5.1 allows us to complete the circle of 
ideas begun in §4.2. 

Proposition 5.4. Let B. be an integral domain. Then B. is a PID if and only if 
for every finitely generated R -module M and every epimorphism 

B m « -2-* M + 0, 

there exist a free R-module B. mi and a homomorphism 7ri : i? m * — ► B™ 0 such that 
the sequence 

0 > /?. m 1 > /?"*« > M > 0 

is exact. 

Proof. The fact that the stated condition implies that B. is a PID was proved 
in Claim 4.11. For the converse, let ir<> : B m ° — > M be an epimorphism; then 
ker tto is free by Proposition 5.1; the result follows by choosing any isomorphism 
7T| : R nXl —* ker(7fo). □ 

The content of Proposition 5.4 is possibly simpler than its awkward formulation: 
Proposition 5.4 is simply a characterization for PIDs analogous to the characteri- 
zation of fields worked out in Proposition 4.10. This is another example of the fact 
that we can study a ring B. by looking at how i?.-Mod is put together. 

The reader can well imagine the *length-n’ version of the condition appearing 
in these results: we may ask for the class of integral domains J7. with the property 
that for all finitely generated modules M and all partial free resolutions of M, 

F-! » l -> R™* — —> M ► 0 , 


^ln order to define a homomorphism on a free module F, one may define it on a boais of F 
and then extend it by linearity; there is no restriction on the possible choices for the images of 
basis elements. The reader who is not sure about this should review the universal property of free 
modules! 
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there exists a homomorphism rr n : /?. mn — * i?"” -1 such that 

K « — 1 7T l ITo 

0 > > R m ° — — > M M) 

is exact. We have proved that this condition characterizes fields for n = 0 and PIDs 
for n = 1. 

The alert reader should now have a dejd vu, as we have already encountered 
a notion that is 0 precisely for fields and 1 for PIDs: the Krull dimension of a 
ring; cf. Example III.4.14. Of course this is not a coincidence, but the full relation 
between the Krull dimension and the bound on the length of free resolutions is be- 
yond the scope of this book 25 . ‘Most’ rings (even rings with finite Krull dimension) 
do not satisfy any such bound for all modules. The local rings satisfying the finite- 
ness condition described above for some n correspond in the language of algebraic 
geometry to ‘smooth’ points on varieties of dimension < n. 

5.3. The classification theorem. The notion of the rank of a free module (Defi- 
nition 1.14) extends naturally to every finitely generated module M over an integral 
domain R. 

Definition 5.5. Let R be an integral domain. The rani; rk M of a finitely generated 
f?-module M is the maximum number of linearly independent elements in M. a 

It should be clear that this number is finite (Exercise 5.6). 

Theorem 5.6. Let R be a PID, and let M be a finitely generated R.-module. Then 
the following hold: 

• There exist distinct prime ideals (qi ), . . . , (q„) € R-, positive integers »’y, and 
on isomorphism 



• There exist nonzero, nonunit ideals (ai), . . . , (o m ) of R., such that («i) 3 
( 02 ) 3 ■ • • 3 (a m ), and on isomorphism 

M “ fl rkAf ® T^r © • • • © 

(ai) («m) 

These decompositions are unique (in the evident sense). 

After all our preparatory work, this statement practically proves itself. We 
will describe the arguments in general terms, leaving the details to the enterprising 
reader. 

The two forms taken by the theorem go under the name of invariant factors 
and elementary divisors, as in the case of abelian groups. As in the case of abelian 
groups, the equivalence between the two formulations amounts to careful bookkeep- 
ing, and we will not prove it formally; see §IV.6.2 for a reminder of how to go from 

25 The most persistent of readers will take a more sophisticated look at these bounds in the 
exercises to §IX.8. 
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one to the other. The role played by Lemma IV.6.1 in that discussion is taken over 
by the Chinese remainder theorem, Theorem V.6.1, in this more general setting. 

As the two formulations are equivalent, it suffices to prove the existence and 
uniqueness of the decompositions given in Theorem 5.6 for any one of the two. 

The existence is a direct consequence of the results in §5.1. Indeed, let M be a 
finitely generated module; thus there is an epimorphism 

Rf 1 — > M to 


where n is the number of generators of M. Apply Corollary 5.3 to the submodule 
ker w C I? 1 : there exist a basis (xy,. . -,x n ) of it" and nonzero elements a l5 . . ., 0 m 
of R such that (ai*i, . . .,a m x m ) is a basis of ker w and further «i | • • • | a TO . 

That is, M is presented by the n x m matrix 


(a\ 

0 

0 ••• 

&rn 

0 ... 

0 

^0 ... 

0 / 


By Proposition 4.13, we may assume that oi , . . . , a m are not units: if any one of 
them is, the corresponding row and column may be omitted from the matrix (and 
n corrected accordingly). It follows that 


(«l) (“m) 


with <Ji | • • ■ | a-m nonzero nonunits, as prescribed by Theorem 5.6, and the existence 
is proved. 

As for the uniqueness of the representations, if 


M = RT © T, 


with T a torsion module 26 , then r = rk M and T = Tbrjj(M) (Exercise 5.10). Thus, 
the ‘free part’ and the ‘torsion part’ in the decompositions given in Theorem 5.6 
are determined uniquely. 

This reduces the uniqueness questioir to the structure of torsion modules: as- 
suming that 


(*) 


e 



= 0 

k,e 


R 

(<Jfc a “)’ 


with pi, irreducible elements of R, the task is to show that the range of the 
indices is the same and, up to reordering, (pj) = (<fc) and ry = sy for all i, j. 

Since a finitely generated module is torsion if and only if its annihilator is 
nonzero (Exercise 4.4), it is reasonable that Ann(M) will be of some use here. 


28 Rec«U that this means that every element of T is a torsion element; cf. §4.1. 
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Lemma 5.7. Let M be a torsion module, expressed as in Theorem 5.6 (with rk M = 
0). Then Ann (M) = ( a, n ). Further, the prime ideals ( qi ) are precisely the prime 
ideals of R containing Ann(A/). 


Proof. By hypothesis 


R R 

M = 7-^r ©-”©7 r, 

( a l) ( a m) 


with ai | ■ • • | o m . If r € Ann(M), then 

0 = r(l, . . . , 1) = (r , r). 


In particular r = 0 modulo (a m ): that is, r 6 (a m ). Thus Ann (A/) C (a m ). 
For the reverse inclusion, assume r € (a m ) and y € A/. Identifying A/ with its 
decomposition, write y = (j/i, . . . , y„,), with y; 6 R((a,). Since r 6 (a m ) C (a,), we 
have ryt = 0 for all i; therefore ry = 0, and r € Ann(A/) as needed. 

For the second part of the statement, tracing the equivalence between the two 
decompositions in Theorem 5.6 shows that the q ( s are precisely the irreducible 
factors of a m , so the assertion follows from the first part. □ 


By Lemma 5.7, the sets {p,}, {<&•} of irreducibles appearing in (*) must co- 
incide (up to inessential units): the ideals they generate are precisely the prime 
ideals containing Ann(A/). The fact that the sets coincide also follows from the 
observation that the isomorphism in (*) must match like primes, in the sense that 
an element of a factor 

R 


(P? J ) 

in the source must land in a combination of quotients in the target by powers of the 
same prime p,-; this follows by comparing annihilators (cf. Exercise 5.11). Therefore, 
uniqueness is reduced to the case of a single irreducible q € R: it suffices to show 

that if „ 

R R _ 2 ^2 

- © - * • © r — r © • • • © , 

(gn) ( q r „. ) ( 9 » I) ( 9 »..) 

with r t > ■■■ > r m and > • •• > s„, then m = n and r; = s ( for all i. This 
situation reproduces precisely the key step the reader used in the proof of uniqueness 
for the particular case of abelian groups, in Exercise IV.6.1. Of course we will not 
spoil the reader’s fun by giving any more details this time (Exercise 5.12). 


Remark 5.8. To summarize, the information carried by a torsion module over a 

PID is equivalent to the choice of a selection of nonzero, nonunit ideals (aj ) (a m ) 

such that 

(a,)3(o 2 )3--0(a m ). 

We have seen that (a m ) is in fact the annihilator ideal of the module; it would be 

nice if we had a similarly explicit description of all invariants («i) (a m ). As a 

preview of coming attractions, we could call the ideal 


(Oi---O m ) 

the characteristic ideal of the module 27 . In situations in which we can compute 
both the annihilator and the characteristic ideal, comparing them may lead to 


27 Waming: This does not seem to be standard terminology. 
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strong conclusions about the module. For example, a torsion module is cyclic if 
and only if its annihilator and characteristic ideals coincide. 

Further, the prime ideals appearing in Theorem 5.6 have been characterized 
as the prime ideals containing the annihilator ideal. They may just as -well be 
characterized as those prime ideals containing the characteristic ideal, as the reader 
will check (Exercise 5.15). 

Finally, note that from this point of view it is immediate that the characteristic 
ideal is contained in the annihilator ideal. We will run into this fact again, in an 
important application (Theorem 6.11). j 


Exercises 

5.1. t> Let N, P be submodules of a module M, such that N n P = {0} and 
M = N + P. Prove that M “ N © P. (This is a word-for-word repetition of 
Proposition IV .5 .3 for modules.) [§5.1] 

5.2. Let R be an integral domain, and let M be a finitely generated f?.-module. 
Prove that M is torsion if and only if rk M = 0. 

5.3. Complete the proof of Corollary 5.3. 

5.4. Let R be an integral domain, and assume that a,b € R are such that a ^ 0, 
b £ (o), and R/(a ), R/(a, b) are both integral domains. 

• Prove that the Krull dimension of R is at least 2. 

• Prove that if R satisfies the finiteness condition discussed in §5.2 for some n, 
then n > 2. 

You can prove this second point by appealing to Proposition 5.4. For a more 
concrete argument, yon should look for an fl-modnle admitting a free resolution of 
length 2 which cannot be shortened. 

• Prove that (a, b) is a regular sequence in R (Exercise 4.13). 

• Prove that the i?-module R/(a, b) has a free resolution of length exactly 2. 

Can you see how to construct analogous situations with n > 3 elements aj , . . . , a n ’l 

5.5. -> Recall (Exercise V.4.11) that a commutative ring is local if it has a single 
maximal ideal m. Let R. be a local ring, and let M be a direct summand of a finitely 
generated free R-module; that is, there exists an R-module N such that M © N is 
a free R-module. 

• Choose elements mi, . . . , m r € M whose cosets mod m M are a basis of M/mM 
as a vector space over the field R/m. By Nakayama’s lemma, M = (mi , . . . , m r ) 
(Exercise 3.10). 

• Obtain a surjective homomorphism it : F = I?® 1 " — *■ M. 
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• Show that 7 r splits, giving an isomorphism F = JWffiker rr. (Apply Exercise III.6.9 
to the surjective homomorphism jr and the free module M © N to obtain a 
splitting M — > F; then use Proposition III.7.5.) 

• Show ker7r/mker7r = 0. Use Nakayama’s lemma (Exercise 3.8) to deduce that 
ker 7r = 0. 

• Conclude that M as F is in fact free. [VIII.2.24, VIII.6.8, VUI.6.11] 

Summarizing, over a local ring, every direct summand of a finitely generated 28 
free i?-module is free. Using the terminology we will introduce in Chapter VIII, we 
would say that ‘projective modules over local rings are free’- This result has strong 
implications in algebraic geometry, since it underlies the notion of vector bundle. 

Contrast this fact with Proposition 5.1, which shows that, over a PID, every 
submodule of a finitely generated free module is free. 

5 . 6 . > Let R. be an integral domain, and let M = (raj, . . . , m T ) be a finitely gener- 
ated module. Prove that rkM < r. (Use Exercise 3.12.) [§5.3] 

5 . 7 . Let R be an integral domain, and let M be a finitely generated module over R. 
Prove that rkM = rk(M/Tor(M)). 

5.8. Let R be an integral domain, and let M be a finitely generated module over 
R. Prove that rkM = r if and only if M has a free submodule N = R r , such that 
M/N is torsion. 

If R. is a PID, then N may be chosen so that 0 —* N -* M -* N/M — » 0 splits. 

5.9. Let A be an integral domain, and let 

0 ► Mi ► Mi ► M 3 > 0 

be an exact sequence of finitely generated B-modules. Prove that rkM 2 = rkMi + 
rkM 3 . 

Deduce that ‘rank’ defines a homomorphism from the Grothendieck group of 
the category of finitely generated Rr modules to Z (cf. §3.4). 

5 . 10 . c> Let R. be an integral domain, M an ft-module, and assume M = R’ ® T, 
with T a torsion module. Prove directly (that is, without using Theorem 5.6) that 
r = rk M and T = Tor^M). [§5.3} 

5 . 11 . > Let R. be an integral domain, let M, N be /J-modulee, and let tp : M — > JV 
be a homomorphism. For me M, show that Aun((m)) C Ann((y>(m))). [§5.3j 

5 . 12 . > Complete the proof of uniqueness in Theorem 5.6. (The hint in Exer- 
cise IV.6.1 may be helpful.) [§5.3j 

5 . 13 . Let M be a finitely generated module over an integral domain R. 

Prove that if if is a PID, then M is torsion-free if and only if it is free. Prove 
that this property characterizes PIDs. (Cf. Exercise 4.3.) 

5 . 14 . Give an example of a finitely generated module over an integral domain which 
is not isomorphic to a direct sum of cyclic modules. 

28 The finite rank hypothesis is actually unnecessary, but the proof is harder without this 
condition. 
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5.15. £> Prove that the prime ideals appearing in the elementary divisor version of 
the classification theorem for a torsion module M over a PID are the prime ideals 
containing the characteristic ideal of M, as defined in Remark 5.8. (§5.3] 

5.16. Prove that the prime ideals appearing in the elementary divisor version of 
the classification theorem for a module M over a PID are the associated primes 
of M, as defined in Exercise 4.5. 

5.17. Let R be a PID. Prove that the Grothendieck group (cf. §3.4) of the category 
of finitely generated F-modules is isomorphic to Z. 


6. Linear transformations of a free module 

One beautiful application of the classification theorem for finitely generated mod- 
ules over PIDs is the determination of ‘special’ forms for matrices of linear maps of 
a vector space to itself. 

Several fundamental concepts are associated with the general notion of a linear 
map from a vector space to itself, and we will review these concepts in this section. 
Not surprisingly, much of the discussion may be carried out for free modules over 
any integral domain R, and we will stay at this level of generality in (most of) the 
section. Theorem 5.6 will be used with great profit when R is a field, in the next 
section. 

6-1. Endomorph isms and similarity. Let R be an integral domain. We have 
considered in some detail the module Horn/? (F,G) of F-uiodule homomorphisms 

F — > G 

between two free modules. For example, we have argued that describing such 
homomorphisms for finitely generated free modules F, G amounts to describing 
matrices with entries in F, up to ‘equivalence’: the equivalence relation introduced 
in §2.2 accounts for the (arbitrary) choice of bases for F and G. 

We now shift the focus a little and consider the special case in which F = G, 
that is, the iZ-module Endjj(F) of endomorphisms a of a fixed free F-inodule F: 

F-^-tF- 

Note that End/j(F) is in fact an E-algebra: the operation of composition makes it 
a ring, compatibly with the Jf-module structure (cf. Exercise 2.3). 

EYom the point of new championed in §2, the two copies of F appearing in 
Endft(F) = Homj?(F, F) are unrelated; they are just (any) isomorphic representa- 
tives of a free module of a given rank. There are circumstances, however, in which 
we need to really choose one representative F and stick with it: that is, view a as 
acting from a selected free module F to itself, not just to an isomorphic copy of 
itself. In this situation we also say that a is a linear transformation of F, or an 
operator on F. 

From this different point of view it makes sense to compare elements of F 
‘before and after’ we apply a; for example, we could ask whether for some v g F 
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we may have a(v) = Av for some A € R, or more generally whether a submodule 
M of F may be sent to itself by a. In other words, we can compare the action of 
a with the identity 29 I : F — + F. 

In terms of matrix representations (in the finite rank case) the description of a 
can be carried out as we did in §2, but with one interesting twist. In §2.2 we dealt 
with how the matrix representation of a homomorphism changes when we change 
bases in the source and in the target. As we are now identifying source and target, 
we must choose the same basis for both source and target. This leads to a different 
notion of equivalence of matrices: 

Definition 6.1. Two square matrices A, B € Mn(R) are similar if they represent 
the same homomorphism F — > F of a free rank-n module F to itself, up to the 
choice of a basis for F. -j 

This is clearly an equivalence relation. The analog of Proposition 2.5 for this 
new notion is readily obtained: 

Proposition 6.2. Two matrices A,B € M. n {R) are similar if and only if there 
exists an invertible matrix P such that 

B = PAP~ l . 

The reader who has really understood Proposition 2.5 will not need any detailed 
proof of this statement: it should be apparent from staring at the butterfly diagram 


i?” B n 



which we are essentially copying from §2.2. The difference here is that we are 
choosing the same basis for source and target; hence the two triangles keeping 
track of the change of basis are the same. Suppose A (resp., B) represents a with 
respect to the choice of basis dictated by ip (resp., if); that is, A (resp., B) is the 
matrix of the ‘top’ (resp., bottom) composition 

oao ip : i?" — + I?” 

(resp., if -1 oao if). If P is the matrix representing the change of basis ir, then 
B = PAP~ l simply because the diagram commutes: 

^ -1 oao if = (ir o<p~ r ) oao (<po x -1 ) = no (tp~ l oao ip) or' 1 . 

Proposition 6.2 suggests a useful equivalence relation among endomorphiems: 

Definition 8.3. Two F-module homomorphisins of a free module F to itself, 

a,/3:F-»F, 


29 The existence of the identity is one of the axioms of a category; every now and then, it is 
handy to be able to invoke it. 
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are similar if there exists an automorphism n : F —> F such that 

p — jroaoir -1 . j 

In the finite rank case, similar endomorphisms are represented by similar matri- 
ces, and two endomorphisms a, 0 are similar if and only if they may be represented 
by the same matrix by choosing appropriate (possibly different) bases on F. The 
interesting invariants determined by similar endomorphisms will turn out (not sur- 
prisingly) to be the same. 

JYorn a group-theoretic viewpoint, similarity is an eminently natural notion to 
study: the group C1L(F) = Aut^(F) of automorphisms of a free module acts on 
Endfl(F) by conjugation, and two endomorphisms a, 0 are similar if and only if 
they are in the same orbit under this action. If the reader prefers matrices, the 
group GL n (R) of invertible nxn matrices with entries in R acts by conjugation on 
the module M n (R) of square »xn matrices, and two matrices are similar if and 
only if they are in the same orbit. 

Natural questions arise in this context. For example, we should look for ways 
to distinguish different orbits: given two endomorphisms a, 0, can we effectively 
decide whether a and 0 are similar? One way to approach this question is to 
determine invariants which can distinguish different orbits. Better still, we can 
look for ‘special’ representatives within each orbit, that is, of a given similarity 
class. In other words, given an endomorphism a : F —> F, find a basis of F with 
respect to which the matrix description of a has a particular, predictable shape. 
Then or, 0 are similar if these distinguished representatives coincide. 

This is what we are eventually going to squeeze out of Theorem 5.6, in the 
friendly case of vector spaces over a fixed field. 

6.2. The characteristic and minimal polynomials of an endomorphism. 
Let a e Endft(F) be an endomorphism of a free P-module; henceforth we are often 
tacitly going to assume that F is finitely generated (this is necessary anyway as 
we are aiming to translate what we do into the language of matrices). We want 
to identify invariants of the similarity class of o, that is, quantities that will not 
change if we replace a with a similar linear map 0 e End^(F). 

For example, the determinant is such a quantity: 

Definition 6.4. Let a € End.s(F). The determinant of « is deta := det A, where 
A is the matrix representing a with respect to any choice of basis of F- j 

Of course there is something to check here, that is, that det A does not depend 
on the choice of basis. But we know (Proposition 6.2) that A , B represent the 
same endomorphism a if and only if there exists an invertible matrix P such that 
B = PAP~ l \ then 

det(fj) = det(PAP~ 3 ) = det(P)det(4)det(P _1 ) = det(A) 

by Proposition 3.3. Thus the determinant is indeed independent of the choice of 
basis. Essentially the same argument shows that if a and 0 are similar linear 
transformations, then det (a) = det(/J) (Exercise 6.3). 
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By Proposition 3.3, a linear transformation a is invertible if and only if deta 
is a unit in R. If R. is a field, this of course means simply deta ^ 0. Even if R is 
not a field, det a ^ 0 says something interesting: 

Proposition 6.5. Let a be a linear transformation of a free R.-modide F = R n . 
Then det a^Oi/ and only if a is injective. 

Proof. Embed R. in its field of fractions K, and view a as a linear transformation 
of K n \ note that the determinant of a is the same whether it is computed over R 
or over K. Then a is injective as a linear transformation R n — ► P” if and only if it 
is injective as a linear transformation K 71 —► K n , if and only if it is invertible as a 
linear transformation K n — > K n , if and only if deta ^ 0. □ 

Of course over integral domains other than fields a may fail to be surjective 
even if deta ^ 0; and care is required even over fields, if we abandon the hypothesis 
that the free modules are finitely generated (cf. Exercises 6.4 and 6.5). 

Another quantity that is invariant under similarity is the trace. The trace of a 
square matrix A = (ay) e M„(R) is 

n 

tr(A) := a.u, 
i=l 

that is, the sum of its diagonal entries. 

Definition 6.8. Let a € Endn(P). The trace of a is defined to be tra := tr A, 
where A is the matrix representing a with respect to any choice of basis of F. j 

Again, we have to check that this is independent of the choice of basis; the key 
is the following computation. 

Lemma 6.7. Let A, B £ foi n (R). Then tx(AB) = tr(BA). 

Proof. Let A = (ay), B = (6y). Then AB = 0C" =1 aikhj)', hence 

n n 

tr(AB) = EE dikbki- 

;=i k= i 

This expression is symmetric in A, B, so it must equal tr(BA). □ 

With this understood, if B =■ PAP -1 , then 

tr(£) = tr((PA)P _1 ) = tr(P _1 (PA)) = tr((P _1 P)A) = tr(A), 

showing (by Proposition 6.2) that similar matrices have the same trace, as needed. 

Again, the reader will check that the same token shows that similar linear 
transformations have the same trace. 

The trace and determinant of an endomorphism a of a raiik-n free module are 
in fact just two of a sequence of n invariants, which can be nicely collected together 
into the characteristic polynomial of a. 
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Definition 6.8. Let F be a free JZ-module, and let a € Endjj(P). Denote by I 
the identity map F -* F. The characteristic polynomial of a is the polynomial 

P a (t) := det(tj - a) € P(f). j 

Proposition 6 . 9 . Let F be a free R-module of rank n, and let a € Eudjj(F). 

• The characteristic polynomial P a (t) is a monic polynomial of degree n. 

• The coefficient oft ’ * _1 in P a (t) equals — tr(c*). 

• The constant term of P a (t) equals (— l) n det(a). 

• If a and f) are similar, then P a (t) = P&(t). 


Proof. The first point is immediate, and the third is checked by setting t = 0. To 
verily the second assertion, let A = (ay) be a matrix representing a with respect 
to any basis for F, so that 


P a (t) = det 


/t - an 
-“21 

“ “12 
t — 022 

-“In ^ 
-“2 n 

\ -«nl 

— ««2 

• t - OnnJ 


Expanding the determinant according to Definition 3.1 (or in any other way), we 
see that the only contributions to the coefficient of t" -1 come from the diagonal 
entries, in the form 

71 

y 

t— i 

and the statement follows. 


Finally, assume that a and fi are similar. Then there exists an invertible tt 
such that /? = 7r o a o 7T _l , and hence 

tl — (3 = ir o (tl — a) o 7 t -1 

are similar (as endomorphisms of i?[t] n ). By Exercise 6.3 these two transformations 
must have the same determinant, and this proves the fourth pomt. □ 


By Proposition 6.9, all coefficients in the characteristic polynomial 
t n - (tra)t n_1 + - • ■ + (-l)”(deta) 


are invariant under similarity; as far as we know, trace and determinant are the 
only ones that have special names. 


Determinants, traces, and more generally the characteristic polynomial can 
show very quickly that two linear transformations are not similar'; but do they tell 
us unfailingly when two transformations are similar? In general, the answer is no, 
even over fields. For example, the two matrices 


1 = 




both have characteristic polynomial (t — l) 2 , but they are not similar. Indeed, / is 
the identity and clearly only the identity is similar to the identity: P/P -1 = I for 
all invertible matrices P. 
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Therefore, the equivalence relation defined by prescribing that two transforma- 
tions have the same characteristic polynomial is coarser than similarity. This hints 
that there must be other interesting quantities associated with a linear transforma- 
tion and invariant under similarity. 

We are going to understand this much more thoroughly in a short while (at least 
over fields), but we can already gain some insight by contemplating another kind 
of ‘polynomial’ information, which also turns out to be invariant under similarity. 
As Endfl(F) is an /{-algebra, we can evaluate every polynomial 

m = r m r + r +••• + »•„€ F[f] 

at any o € End/}(F): 

/(a) = + r,„_ia m 1 + • • • + Co € End/?(F). 

In other words, we can perform these operations in the ring End/?(F): multiplica- 
tion by r € /? amounts to composition with i7 € End«(F). and a k stands for the 
A--fold composition q o • • • o q of o with itself. The set of polynomials such that 
/(a) =0 is an ideal of /?[/] (Exercise 6.7). which we will denote .f u and call the 
annihilator ideal of a. 

Lemma 6.10. If a and li are similar, then = .fj. 

Proof. By hypothesis there exists an invertible tt such that i3 = n oct on -1 . As 

,3 k — (jroo 0 7T -1 ) A ' = (7T OO 07T -1 ) o(7TOQ ott - 1 ) o - • ■ o(7roao7r _1 ) = iroo^'or' 1 , 

we see that for all f(t) € /?(<] we have 

f(l3) = 7T O /(«) O 7T 1 . 

It follows immediately that f(a) = 0 <=> f(i3) = 0, which is the statement. □ 

Going back to the simple example shown above, the polynomial t — 1 is in the 
annihilator ideal of the identity, while it is riot in the annihilator ideal of the matrix 

A= (o l)' 

An optimistic reader might now guess that two linear transformations are sim- 
ilar if and only if both their characteristic polynomials and annihilator ideals co- 
incide. This Ls unfort unatley not the case in general, but we promise that the 
situation will be considerably clarified in short order (cf. Exercise 7.3). 

In any case, even the simple example given above allows us to point out a 
remarkable fact. Note that the (common) characteristic polynomial (f - l) 2 of / 
and .4 annihilates both: 

<* -"-G ?)• 

This is not a coincidence: P n (t) € for all linear transformations a. That is, 
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Theorem 6.11 (Cayley-Hamilton). Let P a (t) be the characteristic polynomial of 
the linear transformation a e End/j(F). Then 

Pa(<») = 0. 

This beautiful observation can be proved directly by judicious use of Cramer’s 
rule 30 , in the form of Corollary 3.5; cf. Exercise 6.9. In any case, the Cayley- 
Hamilton theorem will become essentially evident once we connect these linear 
algebra considerations with the classification theorem for finitely generated modules 
over a PID; the adventurous reader can already look back at Remark 5.8 and figure 
out why the Cayley-Hamilton theorem is obvious. 

If R. is an arbitrary integral domain, we cannot expect too much of f?.[f], and it 
seems hard to say something a priori concerning J* a . However, consider the field of 
fractions K of R (§V.4.2); viewing a as an element of Endit(iC) (that is, viewing 
the entries of a matrix representation of a as elements of K, rather than R), a will 
have an annihilator ideal ‘over K\ and it is dear that 

Jf a = Ci R\t}. 

The advantage of considering C ff[f] is that K[t] is a PID, and it follows that 
has a (unique) monic generator. 

Definition 6.12. Let F be a free H-uiodule, and let a € EndR(F). Let K be 
the field of fractions of R. The minimal polynomial of a is the monic generator 
m a (t) G ff[t] of J*& K \ j 

With this terminology, the Cayley-Hamilton theorem amounts to the assertion 
that the minimal polynomial divides the characteristic polynomial: m a (t ) | P a (t). 

Of course the situation is simplified, at least from an expository point of view, 
if R. is itself a field: then K = R., m a (t) e and = (m 0 (t)). This is one 
reason why we will eventually assume that J? is a field. 

6.3. Eigenvalues, eigenvectors, elgenspaces. 

Definition 6.13. Let F be a free P-module, and let a. e End#(F) be a linear 
transformation of F. A scalar A 6 if. is an eigenvalue for a if there exists v € F, 
v 0, such that 

or(v) = Av. j 

For example, 0 is an eigenvalue for a precisely when a has a nontrivial kernel. 
The notion of eigenvalue is one of the most important in linear algebra, if not in 
algebra, if not in mathematics, if not in the whole of science. The set of eigenvalues 
of a linear transformation is called its spectrum. Spectra of operators show up 
everywhere, from number theory to differential equations to quantum mechanics. 
The spectrum of a ring, encountered briefly in $111.4.3, was so named because it 
may be interpreted (in important motivating contexts) as a spectrum in the sense 


30 Here is an even more direct 'proof': P a (a) = det(af — a) = det{0) = 0. Unfortunately 
this does not work: in the definition det (tf — a) of the characteristic polynomial, t is assumed to 
act as a scalar and cannot be replaced by a. Applying Cramer’s rule circumvents thiB obstacle. 
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of Definition 6.13. The ‘spectrum of the hydrogen atom’ is also a spectrum in this 
sense. 

It is hopefully immediate that similar transformations have the same spectrum: 
for if fi = 7r o a o w -1 and a(v) = Av, then 

/J(?r( v )) = iroQojr - ' (w(v)) = w ° (<*(v)) = ir(Av) = Ax(v); 
as ir(v) 5^ 0 if v 7^ 0, this shoe's that every eigenvalue of or is an eigenvalue of p. 

If F is finitely generated, then we have the following useful translation of the 
notion of eigenvalue: 

Lemma 6.14. Let F be a finitely generated R-module, and let a € Eud«(F). Then 
the set of eigenvalues of a is precisely the set of roots in R of the characteristic 
polynomial P a (t). 

Proof. This is a straightforward consequence of Proposition 6.5: 

A is an eigenvalue for a •<=> 3v ^ 0 such that o(v) = A J(v) 

•4=t> 3v ^ 0 such that (AJ — o)(v) = 0 
Af — a is not injective 
<=i- det(Af — a) = 0 
<=» Pa(A) = 0 

as claimed. □ 

It is an evident consequence of Lemma 6.14 that eigenvalue considerations 
depend very much on the base ring R. 

Example 6.15. The matrix 

G o 1 ) 

has no eigenvalues over R, while it has eigenvalues over C: indeed, the characteristic 
polynomial t 2 + 1 has no real roots and two complex roots. The reader should 
observe that, as a linear transformation of the real plane R 2 , this matrix corresponds 
to a 90° counterclockwise rotation; the reason why this transformation has no (real) 
eigenvalues is that no direction in the plane is preserved through a 90° rotation, j 

Example 6.16. An example with a different flavor is given by the matrix 

n- 

This has characteristic polynomial t 2 — 2; hence it has eigenvalues over R, but 
not over Q. Geometrically, the corresponding transformation flips the plane about 
a line; but that line has irrational slope, so it contains no nonzero vectors with 
rational components. _i 

As another benefit of Lemma 6.14, we may now introduce the following notion: 

Definition 6.17. The algebraic multiplicity of an eigenvalue of a linear transfor- 
mation a of a finitely generated free module is its multiplicity as a root of the 
char acteristic polynomial of a. j 
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Ibr example, the identity on F has the single eigenvalue 1, with (algebraic) 
multiplicity equal to the mnk of F. 

The sum of the algebraic multiplicities is bounded by the degree of the charac- 
teristic polynomial, that is, the dimension of the space. In particular, 

Corollary 6.18. The number of eigenvalues of a linear transformation of R n is 
at most n. If the base ring R. is an algebraically closed field, then every linear 
transformation has exactly n eigenvalues (counted with algebraic multiplicity). 

Proof. Immediate from Lemmas 6.14, V.5.1, and V.5.10. □ 

There is a different notion of multiplicity of an eigenvalue, related to how big 
the corresponding 31 eigenspace may be. 

Definition 6.19. Let A be an eigenvalue of a linear transformation a of a free 
.ft-module F. Then a nonzero v e F is an eigenvector for a, corresponding to the 
eigenvalue A, if a(v) = Av, that is, if v € ker(A/ — a). The submodule ker(A/ — a) 
is the eigenspace corresponding to A. j 

Definition 6.20. The geometric multiplicity of au eigenvalue A is the rank of its 
eigenspace. _i 

This is clearly invariant under similarity (Exercise 6.14). Geometric and alge- 
braic multiplicities do not necessarily coincide: for example, the matrix 

G 1) 

has characteristic polynomial (t — l) 2 , and hence the single eigenvalue 1, with 
algebraic multiplicity 2. However, for a vector v = (”*), 

G :)G)-( n r) 

equals lv if and only if V 2 = 0: that is, the eigenspace corresponding to the single 
eigenvalue 1 consists of the span of the vector (J) and has dimension 1. Thus, the 
geometric multiplicity of the eigenvalue is 1 in this example. 

Contemplating this example will convince the reader that the geometric multi- 
plicity of an eigenvalue is always less than its algebraic multiplicity. In the neatest 
possible situation, an operator a on a free module F of rank n may have all its n 
eigenvalues in the base ring R, and each algebraic multiplicity may agree with the 
corresponding geometric multiplicity. If this is the case, F may then be expressed 
as a direct sum of the eigeuspaces, producing the so-called spectral decomposition of 
F determined by oc (cf. Exercise 6.15 for a concrete instance of this situation). The 
action of a on F is then completely transparent, as it amounts to simply applying 
a (possibly) different scaling factor on each piece of the spectral decomposition. 


31 If we were serious about pursuing the theory over arbitrary integral domains, we would feel 
compelled to call this object the eigen module of ot. However, we are eventually going to restrict 
our attention to the case in which the base ring is a field, so we will not bother. 
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If A is a matrix representing a € EndR(F) with respect to any basis, then a 
admits a spectral decomposition if and only if A is similar to a diagonal matrix: 


^Aj 

0 .. 

. 0 \ 

0 

A 2 . . 

. 0 

W 

0 .. 

• Ki 


where Aj,.. .,A„ are the eigenvalues of a. In this case we say that A (or a) is 
diagonalizable. A moment’s thought reveals that the columns of P are then a basis 
of V consisting of eigenvectors of a. 

Once more, we promise that the situation will become (even) clearer once we 
bring Theorem 5.6 into the picture. 


Exercises 


In these exercises, R denotes an integral domain. 

6 . 1 . Let k be an infinite field, and let n be any positive integer. 

• Prove that there are finitely many equivalence classes of matrices in M n (k). 

• Prove that there are infinitely many similarity classes of matrices in M n (k). 

6 . 2 . Let F be a free f?-module of rank n, and let a,P € Endjj(jF). Prove that 
det(a o p) = det(a) det (/?). Prove that a is invertible if and only if det(a) is 
invertible. 

6 . 3 . > Prove that if a and P are similar in the sense of Definition 6.3, then det(a) = 
det(ft) and tr(a) = tr (p). (Do this without using Proposition 6.9!) (§6.2] 

6 . 4 . > Let F be a finitely generated free ^module, and let a be a linear trans- 
formation of F. Give an example of an injective a which is not surjective; in fact, 
prove that a is not surjective precisely when det R is not a unit. (§6.2] 

6.6. t> Let A: be a field, and view fc[f] as a vector space over k in the evident way. 
Give an example of a fc-linear transformation fc[f] — * fc[f] which is injective but not 
surjective; give an example of a linear transformation which is surjective but not 
injective. (§6.2, §VII.4.1] 

6.6. Prove that two 2x2 matrices have the same characteristic polynomial if and 
only if they have the same trace and determinant. Find two 3x3 matrices with 
the same trace and determinant but different characteristic polynomials. 

6 . 7 . > Let a € Endfl(F) be a linear transformation on a free ft-module F. Prove 
that the set of polynomials f(t) € i?.(f] such that f(a) = 0 is an ideal of (§6.2) 

6.8. > Let A € M n (R) be a square matrix, and let A 1 be its transpose. Prove that 
A and A* have the same characteristic polynomial and the same anniliilator ideals. 
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6.9. t> Prove the Cayley-Hamilton theorem, as follows. Recall that every square 
matrix M has an adjoint matrix, which we will denote adj(M), and that we proved 
(Corollary 3.5) that adj(M) • M = det(M) • /. Applying this to M = tl - A (with 
A a matrix realization of a € Endfl;(F)) gives 

(*) adj(fJ - A) ■ ( tl - A) = P a (t) - I. 

Prove that there exist matrices Bk € Atn(R) such that adj(t/ — A) = 5***; 

then use (*) to obtain P a (A) = 0, proving the Cayley-Hamilton theorem. (§6.2) 

6.10. > Let Fj, F<i be free F- modules of finite rank, and let t*i, resp., 03, be linear 

transformations of Fi, resp., F2. Let F = Fi ® F2, and let a = ® 02 be the 

linear transformation of F restricting to oq on Fi and 02 on F2. 

• Prove that P a (t) — (t)P a2 (t). That is, the characteristic polynomial is mul- 

tiplicative under direct sums. 

• Find an example showing that the minimal polynomial is not multiplicative 
under direct sums. 

[6.11, §7.2] 

6.11. -1 Let a be a linear transformation of a finite-dimensional vector space V, 
and let Vj be an invariant subspace, that is, such that or(Vi) C V\ . Let be 
the restriction of a to Vi, and let V2 = V/V). Prove that a induces a linear 
transformation 02 on V2, and (in the same vein as Exercise 6.10) show that P Q (t) = 

(t)Pcj(t). Also, prove that tro r = traj + tra5, for all r > 0. [6.12] 

6.12. Let at be a linear transformation of a finite-dimensional C-vector space V. 
Prove the identity of formal power series with coefficients in C: 

(Hint: The left-hand side is essentially the inverse of the characteristic polynomial 
of a. Use Exercise 6.11 to show that both sides are multiplicative with respect 
to exact sequences 0 — * V\ — t V -* V 2 — 1 0, where V) is an invariant subspace. 
Use this and the fact that a admits nontrivial invariant subspaces since C is alge- 
braically closed (why?) to reduce to the case dim V = 1, for which the identity is 
an elementary calculus exercise.) 

With due care, the identity can be stated and proved (in the same way) over 
arbitrary fields of characteristic 0. It is an ingredient in the cohomologieal inter- 
pretation of the ‘Weil conjectures’. 

6.13. Let A be a square matrix with integer entries. Prove that if A is a rational 
eigenvalue of A, then in fact A ? Z. (Hint: Proposition V.5.5.) 

6.14. > Let A be an eigenvalue of two similar transformations a, ft. Prove that the 
geometric multiplicities of A with respect to a and ft coincide. [§6.3] 

6.15. > Let a be a linear transformation on a free F-module F, and let vi, . . . , v„ 
be eigenvectors corresponding to pairwise distinct eigenvalues Ai,...,A n . Prove 
that vi, . . . , v„ are linearly independent. (Hint: If not, there is a shortest linear 
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combination rjVj, +■■■■+■ r m v; m = 0 with all rj € R , rj =£ 0. Compare the action 
of q on. this linear combination with the product by A,, .) 

It follows that if F has rank n and at has n distinct eigenvalues, then a induces 
a spectral decomposition of F. [§6.3, VII.6.14] 

6.16. -■ The standard inner product on V = R n is the map VxV-»l defined by 

(v,w) := v* • w 

(viewing elements v 6 V as column vectors). The standard hermitian product on 
W = C" is the map W xW -* C defined by 

(v,w) := v* * w, 

where for any matrix M, M* stands for the matrix obtained by taking the complex 
conjugates of the entries of the transpose M*. 

These products satisfy evident linearity properties: for example, for A € C and 

v, w € W 

(Av,w) =I(v,w). 

Prove 32 that a matrix M € Af„(R) belongs to 0„(R) if and only if it preserves 
the standard inner product on R": 

(Vv,w € R") (Mv, Mw) = (v, w). 

Likewise, prove that a matrix M € -M n (C) belongs to U„ (C) if and only if it 
preserves the standard hermitian product on C”. [6.18] 

6.17. -i We say that two vectors v, w of R" or C” are orthogonal if (v,w) = 0. 
The orthogonal complement v 1 - of v is the set of vectors w that are orthogonal to v. 
Prove that if v ^ 0 in V = R n or C n , then v 1 is a subspace of V of dimension n— 1. 
[7.16, VIII.5.15] 

6.18. -i Let V = R’\ endowed with the standard inner product defined in Exer- 
cise 6.16. A set of distinct vectors Vj, . . . ,v r is orthonormal if (v,, Vj) = 0 for 
i ^ j and 1 for i = j. Geometrically, this means that each vector has length 1, 
and different vectors are orthogonal. The same terminology may be used in C", 

w. r.t. the standard hermitian product. 

Prove that M e O n (R) if and only if the columns of M are orthonormal, if and 
only if the rows of M are orthonormal 33 . 

Formulate and prove an analogous statement for U„(C). (The group U„(C) is 
called the unitary group. Note that, for n = 1, it consists of the complex numbers 
of norm 1.) [6.19, 7.16, VIII.5.9] 

6.19. -■ Let vj, . . . , v r form an orthonormal set of vectors in R". 

32 See Exercise 11.6.1 for the definitions of O n (R) and U,,(C). We trust that basic facts on 
inner products are not new to the reader. Among these, recall that for v, w € R n , ( v, v) equals the 
square of the length of v, and (v, w) equals the product of the lengths of V and w and the cosine 
of the angle formed by v and w. Thus, the reader is essentially asked to verify that M € 0„(R) if 
and only if M preserves lengths and angles, and to check an analogous statement in the complex 
environment. 

^The group O n (R) is called the orthogonal group; orthonormal group would probably be a 
more appropriate terminology. 
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• Prove that vi,...,v r are linearly independent; so they form an orthonormal 
basis of the space V they span. 

• Let w = ai vi H + a r v r be a vector of V. Prove that a< = (v*, w). 

• More generally, prove that if w € R", then (v;, w) is the component of v; in 
the orthogonal projection wv of w onto V. (That is, prove that w — wv is 
orthogonal to all vectors of V.) 



For reasons such as these, it is convenient to work with orthonormal bases. Note 
that, by Exercise 6.18, a matrix is in O n (R) if and only if its columns form an 
orthonormal basis of R". Again, the reader should formulate parallel statements 
for hermitiau products in C". [6.20] 

6.20. The Gram-Schmidt process takes as input a choice of linearly independent 
vectors Vi, . . . , v r of R" and returns vectors wi, . . . , w r spanning the same space V 
spanned by vi, . . . , v r and such that (wj, Wj) = 0 for ijtj. Scaling each Wj by its 
length, this yields an orthonormal basis for V. 

Here is how the Gram-Schmidt process works: 

— wi := vj. 

— For k > 1, w/t := vj. — orthogonal projection of v* onto (wj,. . . , w*_i). 

(Cf. Exercise 6.19.) Prove that this process accomplishes the stated goal. 

6.21. -i A matrix M € At n (R n ) is symmetric if M* = M. Prove that M is 
symmetric if and only if (Vv,w € C n ), (Mv,w) = (v, Mw). 

A matrix M e M n {& n ) is hermitian if M* = M. Prove that M is hermitian if 
and only if (W,w € C”), (Mv,w) = (v, Mw). 

In both cases, oue may say that M is self-adjoint; this means that shuttling it 
from one side of the product to the other does not change the result of the operation. 

A hermitian matrix with real entries is symmetric. It is in fact useful to think 
of real symmetric matrices as particular cases of hermitian matrices. [6.22] 

6.22. -> Prove that the eigenvalues of a hermitiau matrix (Exercise 6.21) are real. 
Also, prove that if v, w are eigenvectors of a hermitian matrix, corresponding to 
different eigenvalues, then (v,w) = 0. (Thus, eigenvectors with distinct eigenvalues 
for a real symmetric matrix are orthogonal.) [7.20] 


7. Canonical forms 

7.1. Linear transformations of free modules; actions of polynomial rings. 
We have promised to use Theorem 5.6 to better understand the similarity relation 
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and to obtain special forms for square matrices with, entries in a field. We are now 
ready to make good on our promise. 

The questions the reader should be anxiously asking are, where is the PID? 
Where is the finitely generated module? Why would a classification theorem for 
these things have anything to tell us about matrices? Once these questions are 
answered, everything else will follow easily. In a sense, this is the one issue that 
we keep in our mind concerning all these considerations: once we remember how to 
get an interesting module out of a linear transformation of vector spaces, the rest 
is essentially an immediate consequence of standard notions. 

Once more, while the main application will be to vector spaces and matrices 
over a field, we do not need to preoccupy ourselves with specializing the situation 
before we get to the key point. Therefore, let’s keep working for a while longer on 
our given integral domain M R. 

Claim 7.1. Giving a linear transformation on a free R-module F is the same as 
giving an R[t\-module structure on F, compatible with its R-module structure. 

Here /?[£] is the polynomial ring in one indeterminate t over the ring R. The 
claim is much less impressive than it sounds; in fact, it is completely tautological. 
Giving an i?jt]-module structure on F compatible with the /Z-module structure is 
the same as giving a homomorphism of 1?- algebras 

ip : H[t] — » Endj?(jF) 

(this is an insignificant upgrade of the very definition of module from §111.5.1). 
By the universal property satisfied by polynomial rings (§111.2.2, especially Exam- 
ple III.2.3), giving tp as an extension of the basic i?-algebra structure of Endji(F) is 
just the same as specifying the image <p{t), that is, choosing an element of End jj(F). 
This is precisely what the claim says. 

Our propensity for the use of a certain language may obfuscate the matter, 
which is very simple. Given a linear transformation a of a free module F, we can 
define the action of a polynomial 

f(t) = r m t m + + ■ • • + ro € i?[i] 

on F as follows: for every v € F, set 

/(t)(v) := r m a m (v) + r m _,a m - 1 (v) + • • • + r 0 v, 

where a k denotes the fc-fold composition of a with itself; we have already run into 
this in §6.2. Conversely, an action of F[f] on F determines in particular a map 
a : F — ► F, that is, ‘multiplication by t’: 

v tv; 

the compatibility with the F-module structure on F tells us precisely that a is a 
linear transformation of F. Again, this is the content of Claim 7.1. 

Tautological as it is, Claim 7.1 packs a good amount of information. Indeed, 
the /?[£] module knows everything about similarity: 

34 In fact, the requirements that R. be an integral domain and that F be free are immaterial 
here; cf. Exercise III.5.11. 
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Lemma 7.2. Let a, 0 be linear transformations of a free R.-module F. Then the 
corresponding R[t]-module structures on F are isomorphic if and only if a and (3 
are similar. 

Proof. Denote by F a , Fp the two i?.[t]-inodules defined on F by a, as per 
Claim 7.1. 

Assume first that a and 0 are similar. Then there exists an invertible F-linear 
transformation ir : F — » F such that 

f3 = jro ooir -1 ; 

that is, it o a = 0 o tt. We can view n as an /?.-linear map 

F a -» Fa- 

We claim that it is i?[fj-linear: indeed, multiplication by t is a in F a and 0 in Fp, 
so 

ff (tv) = iro a(v) = 0o tt(v) = ti r(v). 

Thus w is an invertible F(f]-linear map F a — » Fp, proving that F a and Fp are 
isomorphic as F[tj-modules. 

The converse implication is obtained essentially by running this argument in 
reverse and is left to the reader (Exercise 7.1). □ 

Corollary 7.3. There is a one-to-one correspondence between the similarity classes 
of R-linear transformations of a free R-madvle F and the isomorphism classes of 
R[t]-module structures on F. 

Of course the same statement holds for square matrices with entries in R, in 
the finite rank case. 

Corollary 7.3 is the tool we were looking for, bridging between classifying sim- 
ilarity classes of linear transformations or matrices and classifying modules. The 
task now becomes that of translating the notions we have encountered in §6 into 
the module language and seeing if this teaches us anything new. 

In any case, since we have classified finitely generated modules over PIDs, 
it is clear that we will be able to classify similarity classes of transformations of 
finite-rank free modules — provided that /J[t] is a PID. This is where the additional 
hypothesis that R be a field enters the discussion (cf. Exercise V.2.12). 

7.2. k(f]-modules and the rational canonical form. It is finally time to spe- 
cialize to the case in which R = k is a field and F has finite rank; so F = V is 
simply a finite-dimensional vector space. Let n = dim V. 

By the preceding discussion, choosing a linear transformation of V is the same 
as giving V a ife(f]-module structure (compatible with its vector space structure); 
similar linear transformations correspond to isomorphic A;[t]-modules. Then V is 
a finitely generated module over k[t], and k[f] is a PED since k is a field (Exer- 
cise III.4.4 and §V.2); therefore, we are precisely in the situation covered by the 
classification theorem. 

Even before spelling out the result of applying Theorem 5.6, we can make use 
of its slogan version: every finitely generated module over a PID is a direct sum 
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of cyclic modules. This tells us that we can understand all linear transformations 
of finite-dimensional vector spaces, up to similarity, if we understand the linear 
transformation given by multiplication by t on a cyclic fc[tj-module 


V = 


m 

(/(*»’ 


where f(t) is a nonconstaat monic polynomial: 

f(t) = t n +r n -it n ~ i + • • • + ro. 


It is worthwhile obtaining a good matrix representation of this poster case. We 
choose the basis 

1, t, f”" 1 

of V (cf. Proposition III.4.6). Recall that the columns of the matrix corresponding 
to a transformation consist of the images of the chosen basis (cf. the comments 
preceding Corollary 2.2). Since multiplication by t on V acts as 

1 i-*t, 

t ►— »t 2 , 

< 

1 , »n _ in— 1 „ 

t hh ►£ = —r n -\t — ro, 


the matrix corresponding to this linear transformation is 


/° 

0 

0 

... 0 

-ro ' 

1 

0 

0 

... 0 

— n 

0 

1 

0 

... 0 

— r 2 

0 

0 

0 

... 0 

— r„-2 

V> 

0 

0 

... 1 

—r n -ij 


Definition 7.4. This is called the companion matrix of the polynomial /(t), de- 
noted J 


Theorem 5.6 tells us (among other things) that every linear transformation 
admits a matrix representation into blocks, each of which is the companion matrix 
of a polynomial. Here is the statement of Theorem 5.6 in this context: 


Theorem 7.5. Let k be a field, and let V be a finite-dimensional vector space. Let 
a be a linear transformation on V, and endow V with the corresponding 
structure, as in Claim 7.1. Then the following hold: 


• There exist distinct monic irreducible polynomials pi(t), • • . ,p*(f) € fc[t] and 
positive integers rij such that 


^ = 0 




as k[t]-modules. 
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• There exist momc nonconstant polynomials fi(t),...,f m (t) € k{t] such that 
/>(<) I ••• I /m(<) and 


V S 


(/.(<)) (/*,(«)) 


as k[t]-modules. 


Via these isomorphisms, the action of a on V corresponds to multiplication by t. 

Further, two linear transformations a, ft are similar if and only if they have 
the same collections of invariants Pi(t) r ' J (‘ elementary divisors'), fi(t) (‘invariant 
factors '). 


Proof. Since dim V is finite, V is finitely generated as a fc-module and a fortiori as 
a Ar[t]-module. The two isomorphisms are then obtained by applying Theorem 5.6. 
All the relevant polynomials may be chosen to be monic since every polynomial 
over a field is the associate of a (unique) monic polynomial (Exercise III.4.7). The 
fact that, the action of tv corresponds to multiplication by t is precisely what de- 
fines the corresponding A'[(]-module structure on V. The statement about similar 
transformations follows from Corollary 7.3. □ 


Theorem 7.5 answers (over fields) the question raised in §6.1: we now have a 
list of invariants which describe completely the similarity class of a linear transfor- 
mation. Further, these invariants provide us with a special matrix representation 
of a given linear transformation. 


Definition 7.6. The rational canonical form of a linear transformation a of a 
vector space V is the block matrix 


/ C fxW 


\ 




\ 


C M0 / 


where fi(t), . . . , / m (0 are the invariant factors of a. 


j 


The rational canonical form of a square matrix is (of course) the rational canon- 
ical form of the corresponding linear transformation. The following statement is an 
immediate consequence of Theorem 7.5. 

Corollary 7.7. Every linear transformation admits a rational canonical form. Two 
linear transformations have the same rational canonical form if and only if they are 
similar. 

Remark 7.8. The ‘rational' in rational canonical form has nothing to do with 
Q: it is meant to remind the reader that- this form can be found without leaving 
the base field. The other canonical form we will encounter will have entries in 
a possibly larger field, where the characteristic polynomial of the transformation 
factors completely. j 
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Corollary 7.7 fulfills explicitly another wish expressed at the end of §6.1, that 
is, to find one distinguished representative in each similarity class of matrices/linear 
transformations. The rational canonical form is such a representative. 

The next obvious question is how the invariants we just found relate to our 
more naive attempts to produce invariants of similar transformations in §6. 

Proposition 7.9. Let f\ (t) | • • • | fm (t) be the invariant factors of a linear trans- 
formation a on a vector space V. Then the minimal polynomial m 0 (f) equals f m (.t), 
and the characteristic polynomial P a (t) equals the product fi(t) ■ ■ ■ f m {t). 

Proof. Tracing definitions, (m 0 (f)) is the annihilator ideal of V when this is viewed 
as a fc[f]-module via a (as in Claim 7.1). Therefore the equality of m a (f) and Mt) 
is a restatement of Lemma 5.7. 

Concerning the characteristic polynomial, by Exercise 6.10 it suffices to prove 
the statement in the cyclic case: that is, it suffices to prove that if /(f) is a monic 
polynomial, then /(f) equals the characteristic polynomial of the companion matrix 
C/( t ). Explicitly, let 

/(f) = f n +r n _ 1 f"~ 1 +••• + r 0 

be a monic polynomial; then the task amounts to showing that 


/ f 

0 

0 

... 0 

r ° \ 

-1 

f 

0 

... 0 

n 

0 

-1 

f 

... 0 

r 2 

0 

0 

0 

f 

r n - 2 

\0 

0 

0 

... -1 

f + r„-J 


This can be done by an easy induction and is left to the reader (Exercise 7.2). □ 

Corollary 7.10 (Cayley- Hamilton). The minimal polynomial of a linear transfor- 
mation divides its characteristic polynomial. 

Proof. This has now become evident, as promised in §6.2. □ 

Finding the rational canonical form of a given matrix A amounts to working 
out the classification theorem for the specific fc[f]-module corresponding to A as in 
Claim 7.1. As /c[f] is in fact a Euclidean domain, we know that this can be done by 
Gaussian elimination (cf. §2.4). In practice, the process consists of compiling the 
information obtained while diagonalizing tl-A over fc[f] by elementary operations. 
The reader is invited to either produce an original algorithm or at least look it up 
in a standard reference to see what is involved. 

Major shortcuts may come to our aid. For example, if the minimal and char- 
acteristic polynomials coincide, then one knows a priori that the corresponding 
module is cyclic (cf. Remark 5.8), and it follows that the rational canonical form is 
simply the companion matrix of the characteristic polynomial. 




7. Canonical forms 


377 


In any case, the strength of a canonical form rests in the fact that it allows us 
to reduce general facts to specific, standard cases. Ebr example, the following state- 
ment is somewhat mysterious if all one knows are the definitions, but it becomes 
essentially trivial with a pinch of rational canonical forms: 

Proposition 7.11. Let A e M„(k) be a square matrix. Then A is similar to its 
transpose. 

Proof. If B is similar to A and we can prove that B is similar to its transpose B*, 
then A is similar to its transpose A*: because B = PAP~ l , B* = QBQ~ l give 

A* = (P t QP)A(P t QP)~ 1 . 

Therefore, it suffices to prove the statement for matrices in rational canonical form. 

Further, to prove the statement for a block matrix, it clearly suffices to prove it 
for each block; so we may assume that A is the companion matrix C of a polynomial 
f(t). Since the characteristic and minimal polynomials of the transpose C i coincide 
with those of C (Exercise 6.8), they are both equal to f(t). It follows that the 
rational canonical form of C‘ is again the companion matrix to f(t)\ therefore C l 
and C are similar, as needed. □ 


7.3. Jordan canonical form. We have derived the rational canonical form from 
the invariant factors of the module corresponding to a linear transformation. A 
useful alternative can be obtained from the elementary divisors, at least in the case 
in which the characteristic polynomial factors completely over the field k. If this is 
not the case, one can enlarge k so as to include all the roots of the characteristic 
polynomial: the reader proved this in Exercise V.5.13. The price to pay will be that 
the Jordan canonical form of a linear transformation a € End*( V) may be a matrix 
with entries in a field larger than k. In any case, whether two transformations are 
similar or not is independent of the base field (Exercise 7.4), so this does not affect 
the issue at hand. 


Given a 6 End*(V), obtain the elementary divisor decomposition of the corre- 
sponding &(t)-modnle, as in Theorem 7.5: 


ij 


k\t } 

( Pi(t) Tii )' 


It is an immediate consequence of Proposition 7.9 and the equivalence between 
the elementary divisor and invariant factor formulations that the characteristic 
polynomial P a (t) equals the product 




Lemma 7.12. Assume that the characteristic polynomial P a (t) factors completely; 
that is, 

Pa(t) = ]I(t-A i r i 

t=l 

where \i, i— 1 are the distinct eigenvalues of a (and ini ore their algebraic 
multiplicities; cf. $6.3). Thenpi{t) — (t — Aj), and m; = ]T^- ry . 
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In this situation, the minimal polynomial of a equals 

8 


Proof. The first statement follows from uniqueness of factorizations. The state- 
ment about the minimal polynomial is immediate from Proposition 7.9 and the 
bookkeeping giving the equivalence of the two formulations in Theorem 7.5. □ 

The elementary divisor decomposition splits V into a different collection of 
cyclic modules than the decomposition in invariant factors: the basic cyclic bricks 
are now of the form fc[t]/(p(t) r ) for a monic prime p(t); by Lemma 7.12, assuming 
that the characteristic polynomial factors completely over k, they are in fact of the 
form 

((x-AK) 

for some A e k (which equals an eigenvalue of a) and r > 0. Just as we did for 
‘companion matrices’, we now look for a basis with respect to which a (= multipli- 
cation by t; keep in mind the fine print in Theorem 7.5) has a particularly simple 
matrix representation. This time we choose the basis 

(t-Ar 1 , (t - A) r_2 , ••-, (t — A) 0 = 1. 

Multiplication by t on V acts (in the first line, use the fact that (t— A) r = 0 in V) as 
f (t - A) r_1 ~t(t - A) r_I = A(< - A) r ~* +(t- X) r = A(< - A)* ‘, 

(t - A) r ~ 2 ~f(t - A) r ~ 2 = (t - A ) r -‘ + A (t - A) r-2 , 

1 < — *t — {t — A) 4- A. 

Therefore, with respect to this basis the linear transformation has matrix 


( x 

1 

0 

... 0 

°\ 

0 

A 

1 

... 0 

0 

0 

0 

A 

... 0 

0 

0 

0 

0 

... A 

1 

\0 

0 

0 

... 0 

V 


Definition 7.13. This matrix is the Jordan block of size r corresponding to A, 
denoted J\. r - j 

We can put several blocks together for a given A: for (rj) = (rj, . . . , re), let 





\ 

J Mrj) : = 





\ 


J\.r t / 


With this notation in hand, we get new distinguished representatives of the 
similarity class of a given linear transformation: 
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Definition 7.14. The Jordan canonical form of a linear transformation a of a 
vector space V is the block matrix 


/ Alo-,,) 


\ 




V 


A..(r„,) / 


where (t — Aj) r ‘> are the elementary divisors of a (cf. Lemma 7.12). j 

This canonical form is ‘a little less canonical’ than the rational canonical form, 
in the sense that while the rational canonical form is really unique, some ambiguity 
is left in the Jordan canonical form: there is no special way to choose the order 
of the different blocks any more than there is a way to order 35 the eigenvalues 
Ai, . . . , A s . 

Therefore, the analog of Corollary 7.7 should state that two linear transforma- 
tions are similar if and only if they have the same Jordan canonical form up to a 
reordering of the blocks. As we have seen, every linear transformation a € Endfc(V') 
admits a Jordan canonical form if P a (t) factors completely over k; for example, we 
can always find a Jordan canonical form with entries in the algebraic closure of k. 

Example 7.15. One use of the Jordan canonical form is the enumeration of all 
possible similarity classes of transformations with given eigenvalues. For example, 
there are 5 similarity classes of linear transformations with a single eigenvalue A 
with algebraic multiplicity 4, over a 4-dimensional vector space: indeed, there are 5 
different ways to stack together Jordan blocks corresponding to the same eigenvalue, 
within a 4 x 4 square matrix: 


/ A 

0 

0 

° ) 


( A 1 

0 

° ^ 


/ A 1 

O 

o 

0 

A 

0 

0 


0 A 

0 

0 


0 A 

0 0 

0 

0 

A 

0 

1 

0 0 

A 

0 

> 

0 0 

A 1 

\ 0 

0 

0 

A ) 


\ 0 0 

0 

A J 


^00 

0 A / 


/ A 1 0 

° ^ 


/ A 1 0 0 \ 

0 A 1 

0 


0 A 1 0 

0 0 A 

0 

' 

0 0 A 1 

\ 0 0 0 

A ) 


\ 0 0 0 A / 


The Jordan canonical form clarifies the difference between algebraic and geo- 
metric multiplicities of an eigenvalue. The algebraic multiplicity of A as an eigen- 
value of a linear transformation a is (of course) the number of times A appears in 
the Jordan canonical form of a. Recall that the geometric multiplicity of A equals 
the dimension of the eigenspace of A. 

Proposition 7.16. The geometric multiplicity of \ as an eigenvalue of a equals 
the number of Jordan blocks corresponding to A in the Jordan canonical form of a. 

35 Of course if the ground field is Q or R, then we can order the A,’s in, for example, increasing 
order. However, this is not an option on fields like C. 



380 


VI. Linear algebra 


Proof. As the geometric multiplicity is clearly additive in direct sums, it suffices 
to show that the geometric multiplicity of A for the transformation corresponding 
to a single Jordan block 


J = 


/A 

1 

... 0 

°\ 

0 

A 

... 0 

0 

0 

0 

... A 

1 

\0 

0 

... 0 

V 


is 1. 


Let v = 


be an eigenvector corresponding to A. Then 


f v i\ 

v 2 

— 7 

v 2 


f Al/1 + 

Al>2 + V 3 

Wr/ 

— o 

f ••• 


\ hv T ) 


yielding 

V 2 = ■ ■ ■ — V r ~ 0. 

That is, the eigenspace of A is generated by 

f" 


and has dimension 1, as needed. 


□ 


7.4. Diagonalizability. A linear transformation of a vector space V is diagonal- 
izable if it can be represented by a diagonal matrix. 

The question of whether a given linear transformation is or is not diagonalizable 
is interesting and important: as we pointed out already in §6.3, when a 6 Endfc(V) 
is diagonalizable, then the vector space V admits a corresponding spectral decompo- 
sition: a is diagonalizable if and only if V admits a basis of eigenvectors of a. Much 
more could be said on this important theme, but we are already in the position of 
giving useful criteria for diagonalizability. 

For example, the diagonal matrix representing a will necessarily be a Jordan 
canonical form for a, consisting of blocks of size 1. Proposition 7.16 tells us that 
this can be detected by the difference between algebraic and geometric multiplicity, 
so we get the following characterization of diagonalizable transformations: 

Corollary 7.17. Assume the characteristic polynomial of a € Endjt(y) factors 
completely over k. Then a is diagonalizable if and only if the geometric and alge- 
braic multiplicities of all its eigenvalues coincide. 

Another view of the same result is the following: 
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Proposition T.18. Assume the characteristic polynomial of a € End*(F) factors 
completely over k. Then a is diagonalizable if and only if the minimal polynomial 
of a has no multiple roots. 

Proof. Again, diagonal izability is equivalent to having all Jordan blocks of size 1 
in the Jordan canonical form of a. Therefore, if the characteristic polynomial of a 
factors completely, then « is diagonalizable if and only if all exponents ry appearing 
in Theorem 7.5 equal 1. By the second part of Lemma 7.12, this is equivalent to 
the requirement that the minimal polynomial of a have no multiple roots. □ 

The condition of factorizability in these statements is necessary in order to 
guarantee that a Jordan canonical form exists. Not surprisingly, whether a matrix 
is diagonalizable or not depends on the ground field. For example, 

G o ) 

is not diagonalizable over R (cf. Example 6.15), while it is diagonalizable over C. 

Endowing the vector space V with additional structure (such as an inner prod- 
uct) singles out important classes of operators, for which one can obtain precise and 
delicate results on the existence of spectral decompositions. For example, this can 
be used to prove that real symmetric matrices are diagonalizable (Exercise 7.20.) 
The reader will get a taste of these ‘spectral theorems’ in the exercises. 


Bxercises 


As above, k denotes a field. 

7.1. t> Complete the proof of Lemma 7.2. [§7.1] 

7.2. > Prove that the characteristic polynomial of the companion matrix of a mouic 
polynomial f(t) equals f(t). [§7.2] 

7.3. i> Prove that two linear transformations of a vector space of dimension < 3 are 
similar if and only if they have the same characteristic and minimal polynomials. 
Is this true in dimension 4? [§6.2] 

7.4. t> Let k be a field, and let A" be a field containing k. Two square matrices 
A,B € M n (k) may be viewed as matrices with entries in the larger field K. Prove 
that A and B are similar over k if and only if they are similar over K. [§7.3] 

7.5. Find the rational canonical form of a diagonal matrix 


Ai 

0 .. 

. 0\ 

0 

A2 . . 

. 0 


0 .. 

Af/ 


assuming that the A; are all distinct. 
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7.6. Let 

/ 6 _io -io\ 

>4= 3 -5 -6 . 

V-l 2 3 ) 

• Compute the characteristic polynomial of >4. 

• Find the minimal polynomial of >4 (use the Cayley-Hamilton theorem!). 

• Find the invariant factors of >4. 

• Find the rational canonical form of >4. 

7.7. Let V be a k- vector space of dimension «, and let a € End*(V). Prove that 
the minimal and characteristic polynomials of a coincide if and only if there is a 
vector v 6 V such that 

v, a(v), •••, a"' 1 (v) 

is a basis of V. 

7.8. Let V be a fc-vector space of dimension n, and let a € End*(V). Prove that the 
characteristic polynomial P a (t) divides a power of the minimal polynomial m a (t). 

7.9. What is the number of distinct similarity classes of linear transformations on 
an n-dimensional vector space with one fixed eigenvalue A with algebraic multiplic- 
ity n? 

7.10. Classify all square matrices >4 € M„(k) such that >4 2 = >4, up to similarity. 
Describe the action of such matrices ‘geometrically’. 

7.11. A square matrix >4 € M„(k) is nilpotent (cf. Exercise V.4.19) if A k = 0 for 
some integer k. 

• Characterize nilpotent matrices in terms of their Jordan canonical form. 

• Prove that if A k = 0 for some integer k, then A k = 0 for some integer k no larger 
than n (= the size of the matrix). 

• Prove that the trace of a nilpotent matrix is 0. 

7.12. -> Let V be a finite-dimensional fc-vector space, and let a € End*.(V) be a 
diagonalizable linear transformation. Assume that W C V is an invariant subspace, 
so that a induces a linear transformation a\w € End*(W). Prove that a|w is also 
diagonalizable. (Use Proposition 7.18.) [7.14] 

7.13. -i Let R be an integral domain. Assume that A € M„ (R) is diagonalizable, 
with distinct eigenvalues. Let B € M„(R) be such that AB = BA. Prove that 
B is also diagonalizable, and in fact it is diagonal w.r.t. a basis of eigenvectors 
of A. (If P is such that PAP -1 is diagonal, note that PAP~ X and PBP~ l also 
commute.) [7.14] 

7.14. Prove that ‘commuting transformations may be simultaneously diagonalized’, 
in the following sense. Let V be a finite-dimensional vector space, and let a, 0 € 
End*(V) be diagonalizable transformations. Assume that a0 = 0a. Prove that V 
has a basis consisting of eigenvectors of both a and 0. (Argue as in Exercise 7.13 
to reduce to the case in which V is an eigenspace for a; then use Exercise 7.12.) 
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7.15. A complete flag of subspaces of a vector space V of dimension n is a sequence 
of nested subspaces 

0-V 0 £Vi£...£ V.-igK.-V 

with dim V* = i. In other words, a complete flag is a composition series in the sense 
of Exercise 1.16. 

Let V be a finite-dimensional vector space over an algebraically closed held. 
Prove that every linear transformation a of V preserves a complete flag: that is, 
there is a complete flag as above and such that or(Vj) C Vi. 

Find a linear transformation of R 2 that does not preserve a complete flag. 


7.16. (Sckur form) Let A € Af»(C). Prove that there exists a unitary matrix 
P € U»(C) such that 


/Ai * * 

0 Aj * 


*\ 


P~K 


0 0 ... A„_i * 

\ 0 0 ... 0 A J 


So not only is A similar to an upper triangular matrix (this is already guaranteed 
by the Jordan canonical form), but a matrix of change of basis P ‘triangular izing’ 
the matrix A can hi fact be chosen to be in U n (C). (Argue inductively on n. Since 
C is algebraically closed, A has at least one eigenvector v 1; and we may assume 
(vi,vi) = 1. For the induction step, consider the complement vf; cf. Exercise 6.17, 
and keep in mind Exercise 6.18.) 

The upper triangular matrix is a Schur form for A. It is (of course) not unique. 


7.17. -i A matrix M € M n (C) is normal if MA'P = M^M. Note that unitary 
matrices and hermitian matrices are both normal. 

Prove that triangular normal matrices are diagonal. [7.18] 


7.18. -i Prove the spectral theorem for normal operators: if M is a normal matrix, 
then there exists an orthonormal basis of eigenvectors of M. Therefore, normal 
operators are diagonalizable; they determine a spectral decomposition of the vector 
space on which they act. (Consider the Schur form of M; cf. Exercise 7.16. Use 
Exercise 7.17.) [7.19, 7.20] 

7.19. Prove that a matrix M e Af n (C) is normal if and only if it admits an 
orthonormal basis of eigenvectors. (Exercise 7.18 gives one direction; prove the 
converse.) 

7.20. t> Prove that real synunetric matrices are diagonalizable and admit an or- 
thonorraal basis of eigenvector’s. (This is an immediate consequence of Exercise 7.18 
and Exercise 6.22.) [§7.4] 
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The minuscule amount of algebra we have developed so far allows us to scratch the 
surface of the unfathomable subject of field theory. Fields are of basic importance 
in many subjects — number theory and algebraic geometry come immediately to 
mind — and it is not surprising that their study has been developed to a particularly 
high level of sophistication. While our profoundly limited knowledge will prevent us 
from even hinting at this sophistication, even a cursory overview of the basic notions 
will allow us to deal with remarkable applications, such as the famous problems of 
constructibility of geometric figures. We will also get a glimpse of the beautiful 
subject of Galois theory, an amazing interplay of the theory of field extensions, the 
solvability of polynomials, and group theory. 

1. Field extensions, I 

We first deal with several basic notions in field theory, mostly inspired by easy linear 
algebra considerations. The keywords here are finite, simple, finitely generated, 
algebraic. 

1.1. Basic definitions. The study of fields is (of course) the study of the category 
Fid of fields (Definition III. 1.14), with ring homoniorphisms as morphisms. The 
task is to understand what fields are and above all what they are m relation to one 
another. 

The place to start is a reminder from elementary ring theory 1 : every ring ho- 
momorphisms from a field to a nonzero ring is injective. Indeed, the kernel of 
a ring homomorphism to a nonzero ring is a proper ideal (because a homomor- 
phism maps 1 to 1 by definition and 1 0 in nonzero rings), and the only proper 

ideal in a field is (0). In particular, every ring homomorphism of fields is injective 
(fields are nonzero rings by definition!); every morphism in Fid is a monomorphism 
(cf. Proposition III. 2.4). 

*The last time we made this point was back in §V.5.2. 
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Thus, every morphism k — + K between two fields identifies the first with a 
subfield of the second. In other words, one can view' If as a particular way to 
enlarge k, that is, as an extension of k. Field theory is first and foremost the 
study of field extensions. We will denote a field extension by k C K (which is less 
than optimal, because there may be many ways to embed a field into another); 
other popular choices are K/k (which we do not like, as it suggests a quotienting 
operation) and 

I< 

I 

k 

(which we will avoid most of the time, as it is hard to typeset). 

The coarsest invariant of a field k is its characteristic; cf. Exercise V.4.17. We 
have a unique ring homomorphisms i : Z — * k (Z is initial in Ring: 1 must go to 1 
by definition of homomorphism, and this fixes the value of i(n) for all n e Z); the 
characteristic of k, char k, is defined to be the nonnegative generator of the ideal 
ker »; that is, char k = 0 if i is injective, and char k - p > 0 if kerf = (p) ^ (0). 

Put otherwise, either n • 1 is only 0 in k for n = 0, in which case char A: = 0, 
or n ■ 1 = 0 for some n ^ 0; char k = p > 0 is then the smallest positive integer for 
which p • 1 = 0 in k. Since fields are integral domains, the image i(Z) must be an 
integral domain; hence keri must be a prime ideal. Therefore, the characteristic of 
a field is either 0 or a prime number. 

If k C K is an extension, then char k = char K (Exercise 1.1). Thus, we could 
define and study categories Fldo, Fld p of fields of a given characteristic, without 
losing any information: these categories live within Fid, without interacting with 
each other 2 . Further, each of these categories has an initial object: Q is initial 
in Fldo, and 3 F p Z/pZ in Fid,,; the reader has checked this easy fact in Exer- 
cise V.4.17, and now is an excellent time to contemplate it again. In each case, 
the initial object is called the prime subfield of the given field. Thus, every field is 
in a canonical way an extension of its prime subfield: studying fields really means 
studying field extensions. Tb a large extent, the ‘small’ field k will be fixed in what 
follows, and our main object of study ■will be the category' Fid* of extensions of k, 
with the evident (and by now familiar to the reader) notions of morphisms. 

The first general remark concerning field extensions is that the larger field is 
an algebra, and hence a vector space over the smaller one (by the very definition of 
algebra; cf. Example III.5.6). 

Definition 1 . 1 . A field extension k C F is finite, of degree n. If F has (finite) 
dimension dim F = n as a vector space over k. The extension is infinite otherwise. 

The degree of a finite extension k C F is denoted by [F : fc] (and we write 
[F : k] = oo if the extension is infinite). _i 


2 They do interact in other ways, however — for example, because Z is initial in Ring, so Z 
maps to all fields. 

3 We are going to denote the field S/j>Z by F p in this chapter, to stress its role as a field 
(rather than ‘just 1 as a group). 
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In §V.5.2 we have encountered a prototypical example of finite field extension: 
a procedure starting from an irreducible polynomial f(x) € fc[a;] with coefficients 
in a field and producing an extension K of fc in which f(t) has a root. Explicitly, 

K .. m 
' </(*)> 

is such an extension (Proposition V.5.7). The quotient is a field because fc[f| is 
a PID; hence irreducible elements generate maximal ideals — prime because of the 
UFD property (cf. Theorem V.2.5) and then maximal because nonzero prime ideals 
are maximal in a PID (cf. Proposition III.4.13). The ooset of t is a root of f(x) 
when this is viewed as an element of K[x], The degree of the extension fc C K 
equals the degree of the polynomial f(x) (this is hopefully clear at this point and 
was checked carefully in Proposition UI.4.6). 


The reader may wonder whether perhaps all finite extensions are of this type. 
This is nnfortunately not the case, but as it happens, it is the case in a large 
class of examples. As we often do, we promise that the situation will clarify itself 
considerably once we have accumulated more general knowledge; in this case, the 
relevant fact will be Proposition 5.19. 


Also recall that we proved that these extensions are almost universal with 
respect to the problem of extending fc so that the polynomial f(t) acquires a root. 
We pointed out, however, that the uni in universal is missing; that is, if k C F is 
an extension of k in which f(t) has a root, there may be many different ways to 
put K in between: 

kC K CF. 

Such questions are central to the study of extensions, so we begin by giving a second 
look at this situation. 


1.2. Simple extensions. Let k C F be a field extension, and let a € F. The 
smallest subfield of F containing both fc and a is denoted k(a); that is, k(a) is the 
intersection of all subfields of F containing fc and a. 

Definition 1.2. A field extension fc C F is simple if there exists an element a 6 F 
such that F = fc(o). j 

The extensions recalled above are of this kind: if K = fe[t)/(/(<)) and a denotes 
the coset of f, then K = k(a): indeed, if a subfield of K contains the coset of t, 
then it must contain (the coset of) every polynomial expression in t, and hence it 
must be the whole of K. 

We have always found the notation k(a) somewhat unfortunate, since it sug- 
gests that all such extensions are in some way isomorphic and possibly all isomorphic 
to the field fc(t) of rational functions in one indeterminate f (cf. Example V.4.12). 
This is not true, although it is clear that every element of fe(oc) may be written as 
a rational function in a with coefficients in fc (Exercise 1.3). In any case, it is easy 
to classify simple extensions: they are either isomorphic to k(t) or they are of the 
prototypical kind recalled above. Here is the precise statement. 

Proposition 1.3. Let k C k(a) be a simple extension. Consider the evaluation 
map e : fc[i) — » k(a), defined by f{t) t-> /(a). Then toe have the following : 
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• e is injective if and only if k C k(a) is an infinite extension. In this case, 
k(a) is isomorphic to the field of rational functions k(t). 


• e is not injective if and only if k C k(a) is finite. In this case there exists 
a unique monic irreducible nonconstant polynomial p(t) e fc[fj of degree n = 
[£(<*) : fc] such that 


*(“) = 


m 

mr 


Via this isomorphism, a corresponds to the coset of t. The polynomial p(t) is 
the monic polynomial of smallest degree in fc[t] such that p(a) = 0 in k(a). 


The polynomial p(t) appearing in the first part of the statement is called the 
minimal polynomial of a over k. 

Of course the minimal polynomial of an element a of a (‘large’) field depends 
on the base (‘small’) field k. For example, %/2 € C has minimal polynomial t 2 — 2 
over Q, but t — \fl over R. 


Proof. Let F = k(a). By the ‘first isomorphism theorem’, the image of e : fc[t] — ► F 
is isomorphic to fc[f]/ker(c). Since F is an integral domain, so is fc[t]/ker(«); hence 
ker(e) is a prime ideal in fe(f). 

— Assume ker(e) = 0; that is, e is an iiyective map from the integral domain 
k[t] to the field F. By the universal property of fields of fractions (cf. SjV.4.2), e 
extends to a unique homomorphism 

k(t) -* F. 

The (isomorphic) image of k(t) in F is a field containing k and a; hence it equals F 
by definition of simple extension. 

Since e is injective, the powers a 0 = l,a, a 2 , or 3 , . . . (that is, the images «(t')) 
are all distinct and linearly independent over k (because the powers 1, t, t 2 , ... are 
linearly independent over is); therefore the extension k C F is infinite in this case. 

— If ker(e) / 0, then ker(e) = (p(<)) for a unique monic irreducible nonconstant 
polynomial p(t), which has smallest degree (cf. Exercise 111,4.4!) among all nonzero 
polynomials in ker(e). As (p(t)) is then maximal in the image of e is a subiield 
of F containing a = e(t). By definition of simple extension, F = the image of e; 
that is, the induced homomorphism 

(p(0) 

is an isomorphism. Li this case [F : k] = degp(<), as recalled in §1.1, and in 
particular the extension is finite, as claimed. □ 


The alert reader will have noticed that the proof of Proposition 1.3 is essentially 
a rehash of the argument proving the ‘versality’ part of Proposition V.5.7. 

Example 1.4. Consider the extension QCR. 
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The polynomial x 2 — 2 € Q[x] has roots in R: therefore, by Proposition V.5.7 
there exists a homomorphism (hence a field extension) 


Q[t) 


R, 


(* 2 - 2 ) 

such that the image of (the coset of) t is a root a of x 2 — 2. Proposition 1.3 simply 
identifies the image of this homomorphism with Q(a) C 1. 


This is hopefully crystal clear; however, note that even this simple example 
shows that the induced morphism ? is not unique (hence the ‘lack of uni 5 ): because 
there are more than one root of x 2 - 2 in R. Concretely, there are two possible 
choices for a: a = +V2 and a = -y/2. The choice of a determines the evaluation 
map e and therefore the specific realization of Q[f]/(< 2 — 2) as a subfield of R. 

The reader may be misled by one feature of this example: clearly Q(\/2) = 
Q(-%/2), and this may seem to compensate for the lack of uniqueness lamented 
above. The morphism may not be unique, but the image of the morphism surely 
is? No. The reader will check that there are three distinct sublielda of C isomorphic 
to Q[t]/(t 3 - 2) (Exercise 1.5). 

Ay, there’s the rub. One of our main goals in this chapter will be to single out 
a condition on an extension kCF that guarantees that no matter how we embed F 
in a larger extension, the images of these (possibly many different) embeddings will 
all coincide. Up to further technicalities, this is what makes an extension Galois. 
Thus, Q C Q(\/2) will be a Galois extension, while Q C Q( v'l) will not be a Galois 
extension. But Galois extensions will have to wait until §6, and the reader can put 
this issue aside for now. j 


One way to recover uniqueness is to incorporate the choice of the root in the 
data. This leads to the following refinement of the (uni)versality statement, adapted 
for future applications. 

Proposition 1.5. Let i : ki C F\ — fci(oi), fcj C fj = fcz(aa) be two finite simple 
extensions. Let pi(t) e resp., P 2 H) € be the minimal polynomials of a 1 , 
resp., Q 2 - Let i : ki —* fc 2 be an isomorphism, such that 4 

*(Pl(f)) = P2(t)- 

Then there exists a unique isomorphism j : F\ — ► F$ agreeing with i on k\ and such 
thatj(ai) = ar 2 . 

Proof. Since every element of ki (c*i ) is a linear combination of powers of <*1 with 
coefficients in fci, j is determined by its action on ki (which agrees with i) and by 
j(cti), which is prescribed to be a 2 . Thus an isomorphism j as in the statement is 
uniquely determined. 

To see that the isomorphism j exists, note that as i maps p\ (t) to pi(t), it 
induces an isomorphism 

felt*] fezM 

(Pl(t)) """ (P2(t))' 


4 Ol course any homomorphism of rings f : R —* S induces a unique ring homomorphism 

/ : i?[t] — * S[t] sending t to t, named in the same way by a harmless abuse of language. 
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Composing with the isomorphisms found in Proposition 1.3 gives j: 

fci|f] ~ k 2 [t\ 


j : kx(ai) 


(Pi(*)) (P2(t)) 


k 2 (a 2 ), 


as needed. 


□ 


Thus, isomorphisms lift uniquely to simple extensions, so long as they preserve 
minimal polynomials. We may say that j extends i, and draw the following diagram 
of extensions: 

ki(ai) ^ > £ 2 ( 02 ) 
k\ ^ ► k 2 

We will be particularly interested in considering isomorphisms of a field to 
itself, subject to the condition of fixing a specified subfield k (that is, extending the 
identity on k), that is, automorphisms in Fid*. Explicitly, for k C F an extension, 
we will analyze isomorphisms j . F —> F such that Vc € k,j( c) = c. 

We will denote the group of such automorphisms by Aut*(F). This is probably 
the most important object of study in this chapter, so we should make its definition 
official: 

Definition 1.6. Let k C. F be a field extension. The group of automorphisms of 
the extension, denoted Aut*(F), is the group of field automorphisms j : F — ► F 
such that j\k = id*. j 

Corollary 1.7. Let k C F = k(a) be a simple finite extension, and let p(x) be 
the minimal polynomial of a over k. Then | Aut*(F)| equals the number of distinct 
roots of p(x) in F; in particular, 

I Aut*(F)| < [F : k], 

with equality if and only if p(x) factors over F as a product of distinct linear poly- 
nomials. 


Proof. Let j € Aut*(F). Since every element of F is a polynomial expression in a 
with coefficients in k, and j extends the identity on k, j is determined by j(a). 
Now 

p(j(«)) = j(p(<x)) = j( 0) = 0 : 

therefore, j(a) is necessarily a root of p(x). This shows that | Aut*(F)| is no larger 
them the number of roots of p(x). 

On the other hand, by Proposition 1.5 every choice of a root of p(t) determines a 
lift of the identity to an element j € Aut*(F), establishing the other inequality. □ 

Corollary 1.7 is our first hint of the powerful interaction between group theory 
and the theory of field extensions; this theme will haunt us throughout the chapter. 

The proof of Corollary 1.7 really sets up a bijection between the roots of an 
irreducible polynomial p(t) € fc[t] in a larger field F and the group Aut *(F). A 
particularly optimistic reader may then hope that the roots of p(f) come endowed 
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with a group structure, but this is hoping for too much: the bijection depends on 
the choice of one root a, and no one root of p(t) is ‘more beautiful’ than the others. 
However, we have established that if F = fc(a) is a simple extension of k, then the 
group Autfc(F) acts faithfully and transitively on the set of roots of p(t) m F. In 
other words, Aut*(F) can be identified with a certain subgroup of the symmetric 
group acting on the set of roots of p(t). 

More generally (Exercise 1.6) the automorphisms of any extension act on roots 
of polynomials. One conclusion we can draw from these preliminary considerations 
is that the analysis of Aut*(F) is substantially simplified if k C F is a simple 
extension k(a) and further if the minimal polynomial p(x) of a factors into degp(x) 
distinct linear terms in F. It will not come as a surprise that we will focus our 
attention on such extensions in later sections: an extension is Galois precisely if it 
is of this type. (The reader is invited to remember this fact, but its import will not 
be fully appreciated until we develop substantially more material.) 

1.3. Finite and algebraic extensions. The dichotomy encountered in Proposi- 
tion 1.3 provides us with one of the most important definitions in field theory: 

Definition 1.8. Let k C F be a field extension, and let a € F. Then a is algebraic 
over k, of degree n if n = (fe(a) : &) is finite; a is transcendantal over k otherwise. 
The extension kc F is algebraic if every a e F is algebraic over k. j 

By Proposition 1.3, a e F is algebraic over k if and only if there exists a 
nonzero polynomial f(x) 6 fc[ar] such that /(a) = 0. The minimal polynomial of 
« is the monic polynomial of smallest degree satisfying this condition; as we have 
seen, it is necessarily irreducible. 

Also note that if a is algebraic over k, then every element of k(a) may in fact 
be written as a polynomial with coefficients in k. 

Finite extensions are necessarily algebraic: 

Lemma 1.9. Let k C F be a finite extension. Then every a e F is algebraic over 
k, of degree < [F : &]. 

Proof. Since k C k(a) C F, the dimension of k(a) as a fc-vector space is bounded 
by dim* F = [F : fcj. □ 

Concretely, if k C F is finite and a 6 F, then the powers 1, a-, a 2 ... are 
necessarily linearly dependent; and any nontrivial linear dependence relation among 
them provides us with a nonzero polynomial f(x) G fc[a] such that /(a) = 0. 

The literal converse of the claim that ‘finite extensions are algebraic’ is not true; 
we will see an example in a moment. However, something along these lines holds; 
understanding the situation requires a more careful look at finiteness conditions. 

First of all, compositions of finite extensions are finite extensions, and the 
degree behaves nicely with respect to this operation: 

Proposition 1.10. Let k C E C F be field extensions. Then k C F is finite if and 
only if both kCE and EC F are finite. In this case, 

(F : k] = [F : E\[E : Jfe). 
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Proof. If F is finite-dimensional as a vector space over fc, then so is its subspace E; 
and any linear dependence relation of elements of F over the field k gives one over 
the larger field E. It follows that if k C F is finite, then so are fc C E and EC F. 

Conversely, assume k C E and E C F are both finite. Let (et, . . . , e™) be a 
basis for E over fc, and let (/i, . . . , f n ) be a basis for F over E. It will suffice to 
show that the mn products 

(ei/i,ei/ 2 ,...,e m /n) 

form a basis for F over fc. 

Let g € F. Then 3di , . . . , d n S E such that 

9 — T. djfj, 

3-1 

since the elements fj span F over E. Since E is spanned by the elements e, over 
k , Bcij, Cmj € fc such that Vy 

m 

d i =y, c i3 e i- 

i=l 

It follows that 

m n 

9~yy Wifi : 

i= 1 1 

therefore the products dfj span F over fc. (This suffices to prove that fc C F is 
finite.) 

Tb verify that these elements are linearly independent, assume 

ij 

for Ay € fc. Then we have 

530^ ^ij e i)fj = 

3 i 

implying £ { = 0 f° r all j, since the elements fj are linearly independent 

over E. As the elements e* are linearly independent over fc, this shows that all Ay 
equal 0, as needed. □ 

The formula given in the statement should look familiar to the reader: glance 
at the end of §11.8.5. Of course this is not accidental; the reader should take it as 
another hint that the world of groups and the world of fields have deep interac- 
tion. As in the case of groups, we draw the immediate (but powerful) consequence, 
reminiscent of Lagrange’s theorem: 

Corollary 1.11. Let k C F be a finite extension, and let E be an intermediate 
field (that is, kC E C F). Then both [E : fc] and [F : E\ divide [F : fc]. 

As with Lagrange’s theorem, judicious use of this result will make otherwise 
mysterious statements melt into trivialities. 
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Example 1.12. Let k C F be a field extension, and let a € F be an algebraic 
element over k, of odd order. Then we claim that a may be written as a polynomial 
in a 2 , with coefficients in k. 

Indeed, k(a 2 ) is intermediate between k and k(a): 

k C jfc(a 2 ) C k(a ); 

what can we say about the degree d of k(a) over k(a 2 )? Since a satisfies the 
polynomial t 2 - a 2 € fc(a 2 )[f], d < 2. On the other hand, d divides [k(a) : Ar] 
by Corollary 1.11, and (A(a) : k] is odd, so d ^ 2. Therefore d — 1, proving 
jfc(tt) = k(n 2 ). and in particular a € k(a 2 ), which is the claim. j 

Here is something else that should evoke fond memories for the reader: back 
in §111.6.5 we had encountered an important distinction between finite algebras 
and algebras of finite type. An algebra is finite over the base ring if it is finitely 
generated as a module, that is, if it admits an onto homomorphism (of modules) 
from a finitely generated free module: a commutative algebra is of finite type if 
it admits an onto homomorphism (of algebras) from a polynomial ring in finitely 
many variables. 

Something along the same lines is going to occur here. An extension k C F is 
finite if and only if dim* F is finite, that is, if and only if F is a finite fc-algebra. 
The other finiteness condition takes the following form. 

Definition 1.13. A field extension k C F is finitely generated if there exist 
oi, . . . , q„ € F such that 

F = k(ai)(Q2) ...(<*„). -i 

Remark 1.14. This is not literally the same as saying that F is a finite-type 
A-algebra: even in the case of simple extensions, if a is transcendental, then the 
natural evaluation map e : fc[t] — » k(a) of Proposition 1.3 is not onto. However, 
this is an epimorphism of rings in the categorical sense, as the reader will check 
(Exercise 1.17); thus, the ‘categorical spirit’ behind the finite-type condition is 
preserved in this context. 

This subtlety raises an interesting question: what if a field extension k C F is 
really a finite-type fc-algebra? This turns out to be an important issue, anti we will 
come back to it in §2.2. j 

We will use the notation 

A(Q],Q2,...,Q„) 

for the field k{a\)(ai) . . .(a n ). For k C F and Qi,...,a„ € F, the elements 
of the field F = k(a it ... ,a„) are all the elements of F which may be written 
as rational functions in the Qj’s, with coefficients in k (cf. Exercise 1.3). Put 
otherwise, fc(ai, . . . ,a„) is the smallest subfield of F containing k(en ), . . . , fc(a„) 
(it is the composite of these subfields). It is clear that the order of the elements Qj 
is irrelevant. 

Coining back to the issue of finite vs. algebraic, these two notions do coincide 
for finitely generated extensions. 
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Proposition 1.15. Let ft C F = k(a \, . . . , a n ) be a finitely generated field extens- 
ion. Then the following are equivalent: 

(i) ft C F is a finite extension. 

(it) ft C F is an algebraic extension. 

(Hi) Each Qj is algebraic over k. 

If these conditions are satisfied, then (F : ft) < the product of the degrees of at 
over k. 

Proof. Lemma 1.9 shows that (i) =*■ (ii); (ii) =4> (iii) trivially. Thus, we only 
need to prove that (iii) ==S- (i), and to bound the degree of F over k in the process. 

Assume that each dj is algebraic over k, and let rfj be the degree of a,- over k. 
By definition, k C ft(a$) is finite, of degree <k- By Exercise 1.16, each extension 

fc(ari,...,Oj_i) C fe(Q 1 ,...,aj) 

is finite, of degree < (fi. Applying Proposition 1.10 to the composition of extensions 
ft C ft(a i ) C ft(ai , <* 2 ) C • - • C k(a\ , . . . , On) = F 
proves that ft C F is finite and [F : ft] < di • • dn, as needed. □ 

While rather straightforward, Proposition 1.15 has always seemed remarkable 
to us: it says (in particular) that if a and f) are algebraic over a field ft, then so are 
or ap, or/? -1 , and any other rational function of a and ft. If all we knew were 
the simple-minded definition of ‘algebraic’ (that is, ‘root of a polynomial’), this 
would seem rather mysterious: given a polynomial f(x) of which a is a root and a 
polynomial g(x) of which 0 is a root, how do we construct a polynomial k(x) such 
that (for example) h(a + 0) — 0? The answer is that we do not need to perform 
any such construction, by virtue of Proposition 1.15. 

One immediate consequence of this observation is that the set of algebraic 
elements of any extension forms a field: 

Corollary 1.16. LetkC F be a field extension. Let 

E — (or € F | a is algebraic over ft}. 

Then E is a field. 

Example 1.17. Let Q C C be the set of complex numbers that are algebraic over 
Q; then Q is a field, by Corollary 1.16, and the extension Q C Q is (tautologically) 
algebraic. Note that Q C Q is not a finite extension, because in it there are elements 
of arbitrarily high degree over Q: indeed, there exist irreducible polynomials in Q[x) 
of arbitrarily high degree, as we observed in Corollary V.5.16. 

Elements of Q are called algebraic numbers; the complex numbers in the com- 
plement of Q are transcendental over Q (by definition); they are simply called 
transcendental numbers. Elementary cardinality considerations show that Q is 
countable; thus, the set of transcendental numbers is uncountable. Surprisingly, 
it may be substantially difficult to prove that a given number is transcendental: 
for example, the fact that e is transcendental was only proved around 1870, by 
Charles Hermite. The number it is transcendental (Liudemann, ca. 1880); e* is 
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transcendental (Gelfbnd-Schneider, 1934). No one knows whether tt®, it + e, or 7re 
are transcendental. _i 

Another consequence of Proposition 1.15 is the important fact that composi- 
tions of algebraic extensions are algebraic (whether finitely generated or not): 

Corollary 1.18. Let k C E C F be field extensions. Then k C F is algebraic if 
and only if both k C E and E C F are algebraic. 

Proof. If k C F is algebraic, then every element of F is algebraic over k, hence 
over E, and every element of E is algebraic over k; thus E C F and k C E are 
algebraic. 

Conversely, assume k C E and E C F are both algebraic, and let a e F. Since 
a is algebraic over E, there exists a polynomial 

f(x) = x n + e n -jx n ~ l H 1- eo € .Efc] 

such that f(a) = 0. This implies that in fact a is ‘already’ algebraic over the 
subfield k(e o, . . . , e„_i) C E\ therefore, 

k(®o, • • * , Cji— i ) C k(e o, . . . , fin-1 , or) 
is a finite extension. On the other hand, 

k C k(eq,.. -> e n— i) 

is a finite extension by Proposition 1.15, since each e; is in £ and therefore algebraic 
over k. By Proposition 1.10 

kC k(eo, . . . ,e„_i, a) 

is finite. This implies that a is algebraic over k, as needed, by Leunua 1.9. □ 

We will essentially deal only with finitely generated extensions, and Proposi- 
tion 1.15 will simplify our work considerably. Also, since finitely generated exten- 
sions are compositions of simple extensions, the reader should expect that we will 
give a careful look at automorphisms of such extensions. 

It may in fact come as a surprise that, in many cases, finitely generated ex- 
tensions turn out to be simple to begin with; we will prove a precise statement to 
this effect in due time (Preposition 5.19). The reader already has enough tools to 
contemplate easy (but interesting) examples, such as the following. This should 
serve as an encouragement to look at many more. 

Example 1.19. Consider the extension Q C Q('/2, \/3). 

— By Proposition 1.15 we know that this is a finite (hence algebraic) extension, 
of degree at most 4. 

— Thus any five elements in Q(V , 2, y/S) must be linearly dependent over Q. We 
consider powers of \/2 + \/3: 

1, (V2 + V3), (\/2+V3) 2 , (y/2 + VH) 3 , (y/2f-y/Z) A 

must be linearly dependent. Thus, there must be rational numbers qq, . . . , q$ such 
that 


(x/2 + \/Z) A + q 3 (V2 + \/3) 3 + ^ 2(^/2 + \/3 ) 2 + qi(\/2 + v/3) + qo = 0. 
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— Elementary arithmetic shows that this relation is satisfied for go = 1, gi = 0, 
g 2 = —10, 93 = 0: that is, y/2 + y/3 is a root of the polynomial 

f(t) = t*~ 10 1 2 + 1 . 

hi fact, it is easily checked that f(t) vanishes at all four combinations 

±V2±V3. 


— It follows that 

f{t) = (t-(-y/2-y/3))(t-{-V2+Vz))(t-(y/2- Va))(t - (V2 + \/3)) 

and that f(t) is irreducible over Q (no proper factor of f(t) has rational coefficients). 
Thus \/2 + \/3 has degree 4 over Q. 

— Consider the composition of extensions 

q c q(\^ + V3) c q(v5, Vs). 

By Corollary 1.11, 

4 = [Q(v/2 + Vs) : Q) < [©(v^, v^) : Q). 

On the other hand we know that [Q(%/5, V3) : Q] < 4; thus we can conclude that 
this degree is exactly 4, and it follows that 

Q(\/2,v/3) = Q(V2 + \/3): 

the extension was simple to begin with. (In due time we will prove that this is a 
typical example, not a contrived one.) 

— Staring now at the composition 

Q C Q(v/2) C Q(V2, \/3), 

Proposition 1.10 tells us that [Q(\/2, \/3) : Q('/2)) = 2. That is, a side-effect of the 
computation carried out above is that the polynomial f 2 — 3 must be irreducible 
over Q(\/2); this is not surprising, but note that this method is very different from 
anything we encountered back in §V.5. 

— The fact that the other roots of the minimal polynomial of \/2 + \/3 ‘looked 
so much like’ %/2 + >/3 is not surprising. Indeed, since Q(V2) C Q(v/2, v/3) is 
a simple extension, we know (cf. Proposition 1.5 and following) that there is an 
automorphism of Q(\/2, y/3) fixing Q(\/2) (and hence Q) and swapping \/S and 

— Vs. Similarly, there is an automorphism fixing Q(%/3) and swapping V2 and 

— %/2. These automorphisms must act on the set of roots of f(t) (Exercise 1.6): 
applying them and their composition to V2+ Vs produces the other three roots of 

m ■ 

— In fact, at this point we know that G = Autq(Q(\/2, \/3)) must consist 
of 4 elements (by Corollary 1.7) and has at least two elements of order 2 (both 
automorphisms found above have order 2). This is enough to conclude that G is 
not a cyclic group, and hence it must be isomorphic to the group 2/22 x 2/22. _i 




Exercises 


397 


Exercises 


1.1. i> Prove that if k C K is a field extension, then char k = char K. Prove that 
the category Fid has no initial object. [§1.1] 

1.2. Define carefully the category Fid* of extensions of k. 

1.3. > Let k C F be a field extension, and let a e F. Prove that the field k(a) 
consists of all the elements of F which may be written as a rational function in «, 
with coefficients in k. Why does this not give (in general) an onto homomorphism 
k(t) - k(a)( [§1.2, §1.3] 

1.4. Let k C k(a) be a simple extension, with a transcendental over k. Let E be a 
subfield of k(a) properly containing k. Prove that k(a) is a finite extension of E. 

1.5. o (Cf. Example 1.4.) 

• Prove that there is exactly one subfield of R isomorphic to Q[t]/(t 2 — 2). 

• Prove that there are exactly three subfields of C isomorphic to Qjtj/Ct 3 — 2). 

From a ‘topological’ point of view, one of these copies of Q[ij/(f a — 2) looks very 
different from the other two: it is not dense in C, but the others are. (§1.2) 

1.6. > Let k C F be a field extension, and let f(x) e fc[ar] be a polynomial. Prove 
that Aut*(F) acts on the set of roots of /(«). Provide examples showing that this 
action need not be transitive or faithful. [§1.2, §1.3] 

1.7. Let k C F be a field extension, and let a € F be algebraic over k. 

• Suppose p(x) e &(#] is an irreducible monic polynomial such that p(a) = 0; prove 
that p(x) is the minimal polynomial of a over k, in the sense of Proposition 1.3. 

• Let f{x) e k(*]. Prove that /( a) = 0 if and only if p(x) \ f(x). 

• Show that the minimal polynomial of a is the minimal polynomial of a certain 
fe-linear transformation of F, in the sense of Definition VI.6.12. 

1.8. -> Let f(x) e k[a:] be a polynomial over a field k of degree d, and let e*i , . . . , oy 
be the roots of f(x). For a subset I C {1, . . . , d}, denote by aj the sum <*j. 
Assume that ai £ k only for / = 0 and I = {1, . . . , d}. Prow that f(x) is irreducible 
over k. [7.14) 

1.9. Let k be a finite field. Prove that the order |fc| is a power of a prime integer. 

1.10. -» Let k be a field. Prove that the ring of square n x n matrices M„(k) 
contains an isomorphic copy of every extension of k of degree < n. (Hint: If k C F 
is an extension of degree n and a e F, then ‘multiplication by a’ is a k-linear 
transformation of F.) [5.20] 

1.11. -> Let k C F be a finite field extension, and let p(x) be the characteristic 
polynomial of the k-linear transformation of F given by multiplication by a. Prove 
that p(a) = 0. 
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This gives an effective way to find a polynomial satisfied by an element of an 
extension. Use it to find a polynomial satisfied by V2 + \/3 over Q, and compare 
this method with the one used in Example 1.19. [1.12] 

1.12. -> Let k C F be a finite field extension, and let a € F. The norm of a, 
NkCF(<*), is the determinant of the linear transformation of F given by multiplica- 
tion by a (cf. Exercise 1.11, Definition VI. 6.4). 

Prove that the norm is multiplicative: for a, (3 € F, 

ATjtcfM) = N k cF(a)N k c F (0)- 

Compute the norm of a complex number viewed as an element of the extension 
ICC (and marvel at the excellent choice of terminology). Do the same for elements 
of an extension Q(\/d) of Q, where d is an integer that is not a square, and compare 
the result with Exercise III.4.10. [1.13, 1.14, 1.15, 4.19, 6.18, VIII.1.5] 

1.13. -> Define the trace tr*c f(o) of an element a of a finite extension F of a 
field k by following the lead of Exercise 1.12. Prove that the trace is additive: 

tr*cf(a + 0) = trfccf(a) + tr kcpiP) 

for a, 0 € F. Compute the trace of an element of an extension Q C Q(\/d), for d 
an integer that is not a square. [1.14, 1.15, 4.19, VIII.1.5] 

1.14. -’Let k C k(a) be a simple algebraic extension, and let x d +a c i-ix d ~ 1 hao 

be the minimal polynomial of a over k. Prove that 

tr/tc*(Q)(«) = ~“d - i and 7Vfccfc(a)(<*) = (-l) d oo- 
(Cf. Exercises 1.12 and 1.13.) [4.19] 

1.15. Let A: C F be a finite extension, and let a € F. Assume [F : fc(a)] = r. 
Prove that 

tr*cf(a) = rtrfcc*( 0 )(a) ^ NfcCf(a) = N k g k ( a )(a) T . 

(Cf. Exercises 1.12 and 1.13.) (Hint: If /i,...,/ r is a basis of F over k(a) and 

a has degree d over k, then (/jOr J )i=i,... , r is a basis of F over k. The matrix 

1 = 1 ,- .d - 1 

corresponding to multiplication by a with respect to this basis consists of r identical 
square blocks.) [4.19, 4.21] 

1.16. > Let k C L C F be fields, and let a € F. If k C k(a) is a finite extension, 
then L C L(a) is finite and [L(a) : L] < (/c(q) : fc], [§1.3] 

1.17. t> Let k C F — k(ai,...,a„) be a finitely generated extension. Prove that 
the evaluation map 

*[*!,• ••,*„] -* F, 

is an epimorphism of rings (although it need not be onto). [§1.3] 

1.18. -> Let R be a ring sandwiched between a field k and an algebraic extension F 
of k. Prove that R is a field. 

Is it necessary to assume that the extension is algebraic? [1.19] 
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1.19. Let k C F be a field extension of degree p, a prune integer. Prove that 
there are no subrings of F properly containing k and properly contained in F. (Use 
Exercise 1.18.) 

1.20. Let p be a prime integer, and let a = </2 € R. Let g(x) € Q[x] be any non- 
constant polynomial of degree < p. Prove that a may be expressed as a polynomial 
in g(a) with rational coefficients. 

Prove that an analogous statement for \/2 is false. 

1.21. Let k C F be a held extension, and let E be the intermediate held consisting 
of the elements of F which are algebraic over k. For a € F, prove that a is algebraic 
over E if and only if a € E. Deduce that Q is algebraically closed. 

1.22. Let tCFbea field extension, and let a € F, (3 e F be algebraic, of degree 
d, e, resp. Assume d, e are relatively prime, and let p(.-c) be the minimal polynomial 
of P over k. Prove p( x) is irreducible over fc(ar). 

1.23. Express \/S explicitly as a polynomial function in y/2 + V3 with rational 
coefficients. 

1.24. Generalize the situation examined in Example 1.19: let & be a field of char- 
acteristic ^ 2, and let o, b e k be elements that are not squares in k\ prove that 
k(y/a, \/b) = k(y/a + %/b). 

Prove that k(\/a, %/b) has degree 2, resp., 4, over k according to whether ab is, 
resp., is not, a square in k. 

1.25. -> Let f := \/2 + %/2. 

• Find the minimal polynomial of £ over Q, and show that Q(£) has degree 4 over 

Q. 

• Prove that y/2 — \/2 is another root of the minimal polynomial of £. 

• Prove that y/2- V2 e Q(£). (Hint: (a + h)(a - b) = a 2 - 6 2 .) 

• By Proposition 1.5, sending £ to \/2 — \f2 defines an automorphism of Q(£) over 
Q. Find the matrix of this automorphism w.r.t. the basis 1, £, £ 2 , £ 3 . 

• Prove that AutQ(Q(£)) is cyclic of order 4. 

(M 

1.26. -i Let k C F be a field extension, let I be an indexing set, and let 

be a choice of elements of F. This choice determines a homomorphism <p of k- 
algsbras from the polynomial ring fc[J] on the set I to F (the polynomial ring 
is a free commutative 6-algebra; cf. Proposition III.6.4). We say that is 

algebraically independent over k if tp is injective. For example, distinct elements 
ori, . . . , a« of F are algebraically independent over k if there is no nonzero polyno- 
mial }(x i, 6[xi such that /(«i, .... <*„) = 0. 

Prove that «i , . . . , <x n are algebraically independent if and only if the assign- 
ment 1 1 h+ ai , . . . , tn » a„ defines a homomorphism of 6- algebras (and hence 
an isomorphism) from the field of rational functions k(t \ , . . . , tn) to fc(ai, . . . , «»). 
[1.27] 
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1.27. -> With notation and terminology as in Exercise 1.26, the indexed set {a,}; € / 
is a transcendence basis for F over k if it is a maximal algebraically independent 
set in F. 

• Prove that {«>}t€/ ‘ K a transcendence basis for F over k if and only if it is 
algebraically independent and F is algebraic over &({a,-},- € /). 

• Prow that transcendence bases exist. (Zorn) 

• Prove that any two transcendence bases for F over k have the same cardinality. 
(Mimic the proof of Proposition VI. 1.9. Don’t feel too bad if you prefer to deal 
only with the case of finite transcendence bases.) 

The cardinality of a transcendence basis is called the transcendence degree of F 
over k, denoted tr.degj.gE- [1.28, 1.29, 2.19] 

1.28. -> Let k C E C F be field extensions. Prove that tr.deg A . CF is finite if and 
only if both tr.degj. c£ and tr.deg £CF (see Exercise 1.27) are finite and in this case 

tr.deg A .cF = tr.degj.gE + tr.deg £gr . 

[7.3] 

1.29. -> An extension k C F is purely transcendental if it admits a transcendence 
basis {aj}i € / (see Exercise 1.27) such that F = fc({a;}j € /). 

Prove that any field extension / C F may be decomposed as a purely tran- 
scendental extension followed by an algebraic extension. (Not all field extensions 
may be decomposed as an algebraic extension followed by a purely transcendental 
extension.) [1.30] 

1.30. Let k C k(a) be a simple extension, with rv transcendental over k. Let E be 
a subfield of k(a) properly containing k. Prove that tr.deg fcC£ = 1. 

Liiroth's theorem asserts that in this situation k C E is itself a simple tran- 
scendental extension of k: that is, it is purely transcendental (Exercise 1.29). 


2. Algebraic closure, Nullstellensatz, and a little 
algebraic geometry 

One of the most important extensions of a field k is its algebraic closure k C k\ this 
was mentioned already in §V.5.2, and we can now prove that k exists and is unique 
(up to isomorphism). Once this circle of ideas is approached, the temptation to say 
a few words about the important result known as Hilberts Nullstellensatz. at the 
cost of a small digression into algebraic geometry, will simply be irresistible. 

2.1. Algebraic closure. Recall (Definition V.5.9) that a field K is algebraically 
closed if all irreducible polynomials in A’[x] have degree 1. that is, if every polyno- 
mial in A'[j-] factors completely as a product of linear terms. Equivalently (Exer- 
cise III.4.21) every maximal ideal in A'[x] is of the form (x — c), for c € K. 

Now that we have a little more vocabulary, we can recast this definition yet 
again, as follows. 
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Lemma 2.1. For a field K, the following are equivalent: 

• K is algebraically closed. 

• K has no nontrivial algebraic extensions. 

• If K C L is any extension and a g L is algebraic over K, then a e K . 

The proof is a straightforward application of the definitions and a good exercise 
(Exercise 2.1). 

Definition 2.2. An algebraic closure of a field k is an algebraic extension k C k 
such that k is algebraically closed. j 

Of course, every polynomial f(x) € fc[x) will split into linear factors over k: so 
does every polynomial in the much larger ^[xj, as J; is algebraically closed. The 
requirement that k C k be algebraic ensures that no other intermediate field L, 

k C L £ fc, 

may be algebraically closed: indeed, L C Tc would be a nontrivial algebraic exten- 
sion, contradicting Lemma 2.1. In feet, the only subfield of k containing all roots 
of all nonconstant polynomials in /r[x] is k itself (Exercise 2.2); thus, the algebraic 
closure of a field is ‘as small as possible’ subject to this requirement. Not surpris- 
ingly, k turns out to be unique up to isomorphism, so that we can speak of the 
algebraic closure of k. 

Theorem 2.3. Every field k admits an algebraic closure k C X, - this extension is 
unique up to isomorphism. 

Concerning existence, the idea is to construct ‘by hand’ a huge extension K of 
k where eveiy polynomial f(x) e k\x\ factors completely. The elements of K which 
are algebraic over k will form an algebraic closure of k. 

The construction is done in steps, each step including ‘one more root’ of all 
nonconstant polynomials in fe[*]. Here is a formalization of this step. 

Lemma 2.4. Let k be a field. Then there exists an extension k C K such that 
every nonconstant polynomial f(x) € fc[ar] has at least one root in K. 

Proof. (This construction is apparently due to Emil Artin.) Consider a set 3" = 
{t/} in bisection with the set of nonconstant monk: polynomials fix) € fc[x], and 
let k [& ] be the corresponding polynomial ring 5 in all the indeterminates f/. Let 
I C k\S\ be the ideal generated by all polynomials /(t/). 

Then I is a proper ideal. Indeed, otherwise we could write 

(*) * = °i ' /<(**)> 

i= 1 

where a* e k\3T]. We claim that this cannot be done: indeed, we can construct 
an extension k C F where the polynomials f\(x) y . . . , f n (x ) haw roots Qx, . . . , <x n> 

5 Here is another rare occasion where we need to consider polynomial rings with possibly infin- 
itely many indeterminates. An element of fc[^] is simply an ordinary polynomial with coefficients 
in k y involving a finite number of indeterminates from 
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respectively (apply Proposition V.5.7 n times); view (*) as an identity in F[&], 
and plug in tf. = a,-, obtaining 

n n 

1 = ^ ^ ^ \ ■ 0 = 0, 

»=1 i=l 

which is nonsense. 

Since I is proper, it is contained in a maximal ideal m (Proposition V.3.5). 
Thus, we obtain a field extension 

* 1 * 1 . 


kCK:= 


m 


by construction every nonconstant monic (and hence every nonconstant) polynomial 
f(x) has a root in K, namely the coset of tf. □ 

The field constructed in Lemuia 2.4 contains at least one root of each noncon- 
stant polynomial f(x) € fc[i], but this is not good enough: we are seeking a field 
which contains all roots of such polynomials. Equivalently, not only should f{x) 
have (at least) one linear factor x — a in K[x), but we need the quotient polynomial 
/( x)/(x — a) € K[x\ to have a linear factor and the quotient by that new factor to 
have one, etc. This prompts us to consider a whole chain of extensions 

k C K i C K 2 C K 3 C - ■ • 

where Ki is obtained from k by applying Lemma 2.4, K% is similarly obtained 
from Ki, etc. 

Now consider the union L of this chain 6 . For every two a, b € L, there is ail i 
such that a, b e Ki; we can define a + 6, a • b in L by adopting their definition in 
Ki, and the result does not depend on the choice of i. It follows easily that L is a 
field. 

Claim 2.5. The field L is algebraically closed. 

Proof. If f(x) € L\x\ is a nonconstant polynomial, then f(x ) € Ki for some i; 
hence f(x) has a root in Ki+i C L. That is, every nonconstant polynomial in L\x] 
has a root in L, as needed. □ 

Proof of the existence of algebraic closures. The existence of algebraic clo- 
sures is now completely transparent, because of the following simple observation: 

Lemma 2.6. Let kc L be a field extension, with L algebraically closed. Let 

Tc := {a e L | a is algebraic over A}. 

Then k is an algebraic closure of k. 

The construction reviewed above provides us with an algebraically closed field 
L containing any given field k, so the lemma is all we need to prove. 


6 This is an example of direct limit, a notion which we will introduce more formally in 
§VIIU4. 




2. Algebraic closure, NuBstellensatz, and a little algebraic geometry 


403 


By Corollary 1.16, k is a field, and the extension k C k is tautologically alge- 
braic. To verify that k is algebraically closed, let a be algebraic over k\ then 

k C k C k(a) 

Is a composition of algebraic extensions, so k C k(a) is algebraic (Corollary 1.18), 
and in particular a is algebraic over k. But then « e ft, by definition of the latter. 
It follows that ¥ is algebraically closed, by Lemma 2.1. □ 

Remark 2.7. Zorn’s lemma was sneakily used in this proof (we used the existence 
of maximal ideals, which relies upon Zorn’s lemma), and it is natural to wonder 
whether the existence of algebraic closures may be another statement equivalent to 
the axiom of choice. Apparently, this is not the case 7 . j 

Next, we deal with uniqueness. It may be tempting to try to set things up 
in such a w r ay that algebraic closures would end up being solutions to a universal 
problem; this would guarantee uniqueness by abstract nonsense. However, we run 
into the same obstacle encountered in Proposition V.5.7: morphisms between ex- 
tensions depend on choices of roots of polynomials, so that algebraic closures are 
not ‘universal’ in the most straightforward sense of the term. 

Therefore, a bit of independent work is needed. 

Lemma 2.8. Let ft C L be a field extension, with L algebraically closed. Let ft C F 
be any algebraic extension. Then there exists a morphism of extensions i: F — * L. 

As pointed out above, i is by no means unique! 

Proof. This argument also relies on Zorn’s lemma. Consider the set Z of homo- 
morphisms 

i/( : K —> L 

where if is an intermediate field, ft C K C F, and in restricts to the identity on 
ft; Z is nonempty, since the extension t* : ft C L defines an element of Z. We give 
a poset structure to Z by defining 

*fC d i-K’ 

if K CK' C.F and itc restricts to ix on K. To verify that every chain C in Z has 
an upper bound in Z, let Kc be the union of the sources of all * g € C ( Kc is clearly 
a field); if a e Kc, define tKe(«) be »#(<*), where ix is any element of C such 
that a € K. This prescription is clearly independent of the chosen K and defines 
a homomorphism Kq — > L restricting to the identity on ft. This homomorphism is 
an upper bound for C. 

By Zorn’s lemma, Z admits a maximal element ia, corresponding to an inter- 
mediate field kCGC F. Let H = ic(G) be the image of G in Tc. 

We claim that G = F: this will prove the statement, because it will imply that 
there is a homomorphism If: F — > L extending the identity on ft. 

Arguing by contradiction, assume that there exists anaefsG, and consider 
the extension G C G(a). Since a € F is algebraic over ft, it is algebraic over G; 

Allegedly, the existence of algebraic closures is a consequence of the compactness theorem 
for firot-orier topic, which is known to be weaker than the axiom of choice. 
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thus, it is a root of an irreducible polynomial p(x) € G[x]. Consider the induced 
homomorphism 

i G : G|sc] -► H[x\, 

and let h(x) = *<5(5(2:)). Then h(s) is an irreducible polynomial over H, and it has 
a root 0 in L (this is where we use the hypothesis that L is algebraically closed!). 
We are in the situation considered in Proposition 1.5: we have an isomorphism of 
fields *g : G — *■ H\ we have simple extensions G'(a), H(0); and a, 0 are roots of 
irreducible polynomials g(x), h(x) — ia(g{x)). By Proposition 1.5, ic lifts to an 
isomorphism 

*G(o) ■ G(a) H(0) C L 

sending a to 0. This contradicts the maximally of ic, hence G = F, concluding 
the argument. □ 

Proof of uniqueness of algebraic closures. Let k C k, k C fcj be two algebraic 
closures of k\ we have to prove that there exists an isomorphism ki — ► k extending 
the identity on k. 

Since k C Xi is algebraic and ^ is algebraically closed, by Lemma 2.8 there 
exists a homomorphism i : ki — > k extending the identity on k\ this homomorphism 
is trivially injective, since ki is a field. It is also surjective, by Lemma 2.1: oth- 
erwise ki C k is a nontrivial algebraic extension, contradicting the fact that ki is 
algebraically closed. Therefore i is an isomorphism, as needed. □ 

2.2. The Nullstellensatz. If K is an algebraically closed field, then every maxi- 
mal ideal in K[x] is of the form ( x—c ), for c £ K (Exercise III.4.21). This statement 
has a straightforward-looking generalization to polynomial rings in more indeter- 
minates: if K is algebraically closed, then every maximal ideal in K[x\, . . . ,a;„] is 
of the form (#1 — Ci , . . . , x„ - Cn). 

Proving this statement is more challenging than it may seem from the looks of 
it; it is one facet of the famous theorem known as Hilbert’s Nullstellensatz ( “theorem 
on the position of zeros”). The Nullstellensatz has made a few cameo appearances 
earlier (see Example III.4.15 and §111.6.5 in particular), and we will spend a little 
time on it now. We will not prove the Nullstellensatz in its natural generality, that 
is, for all fields; this would take us too far. Actually, there are reasonably short 
proofs of the theorem in this generality, but the short arguments we have run into 
end up replacing a wider (and useful) context with ingenious cleverness; we do not 
find these arguments particularly insightful or memorable. 

By contrast, the theorem has a very simple and memorable proof if we make 
the further assumption that the field is uncountable. Therefore, the reader will have 
a complete proof of the theorem (in the form given below, which does not assume 
the field to be algebraically closed) for fields such as R and C but will have to trust 
on faith regarding other extremely important fields such as Q. 

The most vivid applications of the Nullstellensatz involve basic definitions in 
algebraic geometry, and we will get a small taste of this in the next section; but the 
theorem itself is best understood in the context of the considerations on ‘finiteness’ 
mentioned in §1.3; see especially Remark 1.14. We pointed out that if k C F 
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is finitely generated as a field extension, then F need not be finitely generated 
(that is, ‘finite-type’) as a k-algebra, and we raised the question of understanding 
extensions F that are Unite- type algebras. Hie following result answers this 
question. It is our favorite version of the Nullstellensatz; therefore we wiU name 
it so: 

Theorem 2.9 (Nullstellensatz). Let k C F be a field extension, and assume that F 
is a finite-type k-algebra. Then kC F is a finite (hence algebraic) extension. 

The reader should pause and savor this statement for a moment. As pointed 
out in §111.6.5, an algebra k —* S may very well be of finite type (as an algebra), 
without being finite (i.e., tinitely generated as a module): S = fc[f] is an obvious 
example. The content of Theorem 2.9 is that the distinction between finite-type 
and finite disappears if S is a field. 

As mentioned above, we will only prove this statement under an additional 
(and somewhat unnatural) assumption, which reduces the argument to elementary 
linear algebra. 

Proof for uncountable fields. Assume that k is uncountable. 

Let k C F be a field extension, and assume that F is finitely generated as an 
algebra over k; in particular, it is finitely generated as a field extension. We have 
to prove that kC F is a finite extension, or equivalently (by Proposition 1.15) that 
it is algebraic, that is, that every a € F s k is the root of a nonzero f(x) € /c[a?]. 

Now, by hypothesis F is the quotient of a polynomial ring over k: 

» F i 

this implies that there is a countable basis of F as a vector space over k, because 
this is the case for k\x%, . . . , x„], Consider then the set 


{— I = 

U- C Jc€* 


cek 

this is an uncountable subset of F; therefore it is linearly dependent over k (Propo- 
sition VI. 1.9). That is, there exist distinct Cj, . . . ,c m € k and nonzero coefficients 
Xi,...,X m ek, such that 

Xi , X m 




= 0. 


a — ci at — c m 

Expressing the left-hand side with a common denominator gives 

M = 0 

*(a) ’ 

for /(«) 0 in fe[jr] and g(a) y£ 0 (Exercise 2.4). This implies f(a) = 0 for a 

nonzero f(x) € fc[x], and we are done. □ 

Regardless of its proof, the form of the Nullstellensatz given in Theorem 2.9 
does not look much like the other results we have mentioned as associated with it. 
For one thing, it is not too clear why Theorem 2.9 should be called a theorem on 
the ‘position of zeros’. The result deserves a little more attention. 
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The form stated at the beginning of this section is obtained as follows: 

Corollary 2.10. Let K be an algebraically closed field, and let I be an ideal of 
K[x \ , . . . , x„] . Then I is maximal if and only if 

I = ( 2:1 — Ci, , X n — Cn), 

fore i,...,c„ € K. 

Proof. For cj, . . . , c„ € K 

K[xi, . ■ .,Xn] ^ 

{x\ - Cl, . . . ,x n - Cn) 

(Exercise III.4.12) is a field; therefore (au — ci, , . . ,x n — c n ) is maximal. Conversely, 
let m be a maximal ideal in K[x i, . . . ,x n ), so that the quotient is a field and the 
natural map 

K I, . ■ ■ ,#«] 

m 

is a field extension. The field L is finitely generated as a JC-algebra; therefore 
K C L is an algebraic extension by Theorem 2.9. Since K is algebraically closed, 
this implies that L = K (Lemma 2.1); therefore, there is a surjective homomorphism 

>p : !f(a;i,...,a: n ] — ► K 
such that m = ker ip. Let c, := <p{xi). Then 

(au - Cl, . . . ,x n - Cn) C kerp = m; 

but (*i — ci , , , . , x n — Cn) is maximal, so this implies m = (asi - ci , . . . , x n — c„), as 
needed. □ 

Note how very (trivially) false the statement of Corollary 2.10 is over fields 
which are not algebraically dosed: (a; 2 + 1) is maximal in R[x). No such silliness 
may occur over C, for any number of variables. 

2.3. A little affine algebraic geometry. Corollary 2.10 is one of the main rea- 
sons why ‘classical’ algebraic geometry developed over C rather than fields such 
as R or Q: it may be interpreted as saying that if K is algebraically closed, then 
there is a natural bijection between the points of the product space K n and the max- 
imal ideals in the ring K[x\, This is the beginning of a fruitful dictionary 

translating geometry into algebra and conversely. It is worthwhile formalizing this 
statement and exploring other ‘geometric’ consequences of the Nullstellensatz. 

For K a field, denotes the affine space of dimension n over K, that is, the 
set of re-tuples of elements of K: 

a K = {(ci. -,Cn)\ci e K}. 

Elements of are called points. 

Objection: Why don’t we just use 'K n ' for this object? Because the latter 
already stands for the standard repdimensional vector space over AT; so it carries 
with it a certain baggage: the tuple (0, . . . , 0) is special, so are the vector subspaces 
of K n , etc. By constrast, no point of A^ is ‘special’; we can carry any point to 
any other point by a simple translation. Similarly, although it is useful to consider 
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affine subspaces, these (unlike vector subspaces) are not required to go through the 
origin. Affine geometry and linear algebra are different in many respects. 

We have already established that, by the Nullstellensatz, if K is algebraically 
closed, then there is a natural bijection between the points of A’^ and the maximal 
ideals of the polynomial ring A^xi, . . . ,x„]. Concretely, the bijection works like 
this: the point p = (ci, . . . , c„) corresponds to the set of polynomials 

^(P) := {/(£) € A'[x, x„] |/(p) = 0}, 

where f(p) is the result of applying the evaluation map: 

f(x u ...,x„) >-> f(a,...,c„) e K. 

This of course is nothing but the homomorpliism 

<P : A[xi,...,x„] -> K 

defined by prescribing x, i-> c,; the set -f{p) is simply ker <p, which is the maximal 
ideal 

(xi — Cl, . . . ,x„ — c„). 

This correspondence p •-> ./(p) may be defined over every field; the content of 
Corollary 2.10 is that every maximal ideal of K[x i, . . . ,x„] corresponds to a point 
of A£- in this way, if K is algebraically closed. 

It is natural to upgrade the correspondence as follows (again, over arbitrary 
fields): for every subset 5 CAJ, consider the ideal 

S(S) := {/(x) € A’[xi, . . . ,x„) | Vp € S,f(p) = 0}, 

that is, the set of polynomials ‘vanishing along S’; it is immediately checked that 
this is indeed an ideal of A"[xi, . . . ,x„]. We can also consider a correspondence 
in the reverse direction, from ideals of A(xi, . . . ,x„) to subsets of A*, defined by 
setting for every ideal /, 

r(/) := {p = (c, . . . , c„) € A " K I V/ 6 /, /( c,, . . . , c n ) = 0}. 

Thus, y(I) is the set of common solutions of all the polynomial equations / = 0 
as / € /: the set of points ‘cut out in A£-’ by the polynomials in I. 

In its most elementary manifestation, the dictionary mentioned in the beginning 
of the section consists precisely of the pair of correspondences 

y 

{subsets of Aj{-} {ideals in A[xi, . . . , x,,]} . 

The set y(l) is often (somewhat improperly) called the variety of /, while y(S) 
is (also improperly) the ideal of S. 

The function t can be defined (using the same prescription) for any set A C 
A'[xi , . . . , x„), whether A is an ideal or not: it is clear that y(A) = y(I) where I is 
the ideal generated by A (Exercise 2.5). Hilbert’s basis theorem (Theorem V.1.2) 
tells us something rather interesting: A'[xi , . . . , x„] is Noetherian when A' is a field; 
hence every ideal 1 is generated by a finite number of elements. Thus, for every 
ideal / there exist polynomials f\, . . . ,f r such that (abusing notation a little) 

y(D = y(f fr). 
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In other words, if a set may be defined by any set of polynomial equations, it may 
in fact be defined by a finite set of polynomial equations 8 . 

We will pass in silence many simple-minded properties of the functions y, £ 
(they reverse inclusions, they behave reasonably well with respect to unions and 
intersections, etc.); the reader should feel free to research the topic at will. But 
we want to focus the reader’s attention on the fact that (of course) y , J are not 
bijections ; this is a problem, if we really want to construct a dictionary between 
‘geometry’ and ‘algebra’. 

For example, there are (in general) lots of subsets of A£- which are not cut out 
by a set of polynomial equations, and there are lots of ideals in K[x\ , . . . , *„) which 
are not obtained as (5) for any subset S C A^-. 

We will leave to the reader the task of finding examples of the first phenomenon 
(Exercise 2.6). The way around it is to restrict our attention to subsets of A’fi which 
are of the form y (I) for some ideal I. 

Definition 2.11. An (affine) algebraic set is a subset S C such that there 
exists an ideal I C K[xi, for which S =y{I). j 

This definition may seem like a cheap patch: we do not really know what the 
image of y is, so we label it in some way and proceed. This is not entirely true — the 
family of algebraic subsets of a given h’fi has a solid, recognizable structure: it is 
the family of closed subsets in a topology on A^ (Exercise 2.7); this topology is 
called the Zariski topology. 

Studying affine algebraic sets is the business of affine algebraic geometry. The 
aim is to understand ‘geometric’ properties of these sets in terms of ‘algebraic’ 
properties of corresponding ideals or other algebraic entities associated with them. 

Example 2.12. Here are pictures of two algebraic subsets of A|: 




The first is y{(y — a: 2 )); the second is y((y 2 — * 3 )). What feature of the ideal 
(y 2 — z 3 ) is responsible for the ‘cusp’ at (0, 0) in the second picture? The reader 
will find out in any course in elementary algebraic geometry. j 

The fact that J 2 ' is not surjective leads to interesting considerations. The 
standard example of an ideal that is not the ideal of any set is (x 2 ) in K\x J: because 

8 We distinctly remember being surprised by this fact the first time we ran into it. 
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wherever x 2 vanishes, so does .r; hence if ;r 2 € -?(S) for a set 5, then x € -?{S) as 
well. This motivates the following definitions. 

Definition 2.13. Let I be an ideal in a commutative ring R. The radical of / is 
the ideal 

VI := {r € f? | 3A - > 0, r k € /}. 

An ideal / is a radical ideal if / = v/7. -> 

The reader should check that VI is indeed an ideal; it may be characterized as 
the intersection of all prime ideals containing / (Exercise 2.8). 

Example 2.14. An element of a commutative ring R is nilpotent if and only if it is 
in the radical of the ideal (0) (cf. Exercise V.4.19). The reader has encountered this 
ideal, called the nilradical of R, in the exercises, beginning with Exercise I1I.3.12. 

Prime ideals p are clearly radical: because f k € p =► / 6 p. and therefore 
^pCp (the other inclusion holds trivially for all ideals). j 

Lemma 2.15. Let K be a field, and let S be a subset of A',\ . Then the ideal -f(S) 
is a radical ideal of K [ j: i . . . . ,x„]. 

Proof. The inclusion >/(S) C V^JS) holds for every ideal, so it is trivially sat- 
isfied. To verify the inclusion C .f(S), let / 6 i/./'fS). Then there is an 

integer k > 0 such that } k € ./(S); that is, 

(VpeS), f(p) k = o. 

But then 

(Vp € 5), /(p) = 0, 

proving f € ./(S), as needed. □ 

In view of these considerations, for any field we can refine the correspondence 
between subsets of A£ and ideals of A'(j*i, . . . , .r„] as follows: 

./ 

{algebraic subsets of A’L } {radical ideals in A'fxj, . . . ,x„]} : 

i 

as we have argued, it is necessary to do so if we want to have a good dictionary. In 
the rest of the section, .f and Y will be taken as acting between these two sets. 

While we have tightened the correspondence substantially, the situation is still 
less than idyllic. Over arbitrary fields, the function Y is now surjective by the very 
definition of an affine algebraic subset (cf. Exercise 2.9); but it is not necessarily 
injective. An immediate counterexample is offered over A' = R by the ideal (j , 2 + 1): 
this is maximal, hence prime, hence radical, and 

Y(x 2 + 1) = 0 = Y( 1). 

Here is where the Nullstellensatz comes to our aid, and here is where we see what 
the Nullstellensatz has to do with the 'zeros’ of an ideal (= Y of that ideal). 

Proposition 2.16 (Weak Nullstellensatz). Let K be an algebraically closed field, 
and let / C A'[x) ,r„] be an ideal. Then Y(I) = 0 if and only »// = (!). 
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Proof. If I = (1), then y[I) = 0 by definition. 

Conversely, assume that I -fi (1). By Proposition V.3.5, I is then contained in 
a maximal ideal m. Since K is algebraically dosed, by Corollary 2.10 we have 

m = (ari — ci, c„) 

for some ci, . . . , c„ € K\ so V/(*i, . e / there exist gi g„ 6 K[x 

such that 

n 

f[x u = g{x 1, . . . , X„)(a5i - Ci). 

t=l 

In particular, 

n 

f(c\ , . . . , <v) = y \ g{c\ , . . . , Cn) (cj — Cj) = 0 : 

»=i 

that is, (ci, . . .,c„) € y (J). This proves "V (I) 0 if I £ (1), and we are done. □ 

This is encouraging, but it does not seem to really prove that V is injective. 
As it happens, however, it does: we can derive from the ‘weak’ Nullstellensatz the 
following stronger result, which does imply injectivity: 

Proposition 2.17 (Strong Nullstellensatz). Let K be an algebraically closed field, 
and let I C K[x i, . . . , s n ] be an ideal. Then 

>(r(/)) = V7. 

This implies that the composition / o T is the identity on the set of radi- 
cal ideals; that is, y has a left-inverse, and therefore it is injective (cf. Preposi- 
tion 1.2.11). 

Proof. Note that <y(y{I)) is a radical ideal (by Lemma 2.15). It follows immedi- 
ately from the definitions that I C ^(y (/)); therefore 

VIcy/s(y(i)) = s{y(i)). 

We have to verify the reverse inclusion: assume that / € (/)); that is, assume 

that 

f(p) = 0 

for all p e A'^ such that Vg e I,g(p) = 0; we have to show that there exists an m 
for which f m € I. 

The argument is not difficult after the fact, but it is fiendishly clever 9 . Let y 
be an extra variable, and consider the ideal J generated by I and 1 — fy in 
K[x u . . . , x„, y]. More explicitly, assume 

1 = (3i> •••>£&•) 

(since K[x y, . . . ,x„] is Noetherian, finitely many generators suffice); we can view 
0s(*i, • • • >*«), resp., /(*!, ■ • • ,x„) € K[x i, . . as polynomials Gi {x x , . . . ,x„,y), 

resp., F{x\ ,...,x n ,y) 6 K\x\, . . . ,x n ,y\, aud let 

J = (Or, .... G r , 1 - Fy). 

9 This is called the Rabinowitsch trick and dates back to 1929. It was published iu a one-page, 
eighteen-line article in the Mathematische Arraalen. 
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As K[x i, . . . ,x n ,y\ has n + 1 variables, the ideal J defines a subset V(J) in A^ +1 . 
We claim V[J) = 0. 

Indeed, assume on the contrary that V[J) # 0, and let p — in 

A£- +1 be a point of € 1 / [J). Then for i = l,...,r 


Gi(a u ...,a n ,y) = 0 ; 

but this means 


gi(a u ...,a„) =0 

for i = 1, . . . ,r; that is, [a\, . . . ,a„) € ^(/). Since / vanishes at all points of V[I), 
this implies 

/(«!,. ■ • ,a„) = 0, 


and then 


(1 - Fy)[a u ...,a„,b) = 1 - f[a u . . . ,a n )b = 1-0=1. 


But this contradicts the assumption that p € y(J)- Therefore, f[J) = 0. 

Now use the weak Nullstellensatz: since K is algebraically closed, we can con- 
clude J = (1). Therefore, there exist polynomials . . . ,x„, y), i = l,...,r, 

and L(x\ ,...,x„,y), such that 


y)Gj(xi ,x„,y) + L(x Xn,!/)(1 - F(x t , . . . ,x„, y)y) = 1. 

i=i 


We are not done being clever: the next step is to view this equality in the field 
of rational functions over K rather than in the polynomial ring, which we can do, 
since the latter is a subring of the former. We can then plug in y = 1/F and still 
get an equality, with the advantage of killing the last summand on the left-hand 
side: 



Further, 


Gi 




. . . , X,| ) 


[Gi is just the name of <?, in the larger polynomial ring and only depends on the 
first n variables), while 


Hi 



Mxi, ... 1 3:«) 
/(x i,...,x„) m ’ 


where hi 6 K[x\, . . . ,x„] and m is an integer large enough to work for all t = 
1, . . . , r. Expressing in terms of a common denominator, we can rewrite the identity 
as 

hiffi + H h r g r _ 

or 

f m — higi H h h r g r . 


We have proved this identity in the field of rational functions; as it only involves 
polynomials, it holds in K[x i, . . . ,x„). The right-hand side is an element of /, so 
this proves / 6 \/l, and we are done. □ 
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The addition of the variable y in the proof looks more reasonable (and less like 
a trick) if one views it geometrically. The variety Y{1 — xy) in A^ is an ordinary 
hyperbola: 



The projection (x, y) x may be used to identify ^(1 — xy) with the complement 
of the origin (that is, y(x)) in the “*-axis” (viewed as A^). Similarly, T(1 — fy) 
is a way to realize the Zariski open set A\ \ y(f) as a Zariski closed subset in a 
space A^ +1 of dimension one higher. 

The proof of Proposition 2.17 hinges on this fact, and this fact is also an 
important building block in the basic set-up of algebraic geometry, since it can be 
used to show that the Zariski topology has a basis consisting of (sets which are 
isomorphic in a suitable sense to) affine algebraic sets. 

We will officially record the conclusion we were able to establish concerning the 
functions y, «/: 

Corollary 2.18. Let K be an algebraically closed field. Then for any n > 0 the 
functions 

{algebraic subsets of A^} {radical ideals in K[xi, . . . ,£„]} 
are inverses of each otker. 

Proof. Proposition 2.17 shows that /of is the identity on radical ideals, so V is 
injective, and y is surjective by definition of affine algebraic set. It follows that 
y is a bijection and 4 is its inverse. □ 

Summarizing: If K is algebraically closed, then studying sets defined by poly- 
nomial equations in a space K n is 'the same thing as* studying radical ideals in a 
polynomial ring /ffcii • • • 

This correspondence is actually realized even more effectively at the level of 
if -algebras. We say that a ring is reduced if it has no nonzero nilpotents; then R/I 
is reduced if and only if / is a radical ideal (Exercise 2.8). 
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Definition 2.19. Let S C be an algebraic set. The coordinate ring of S is the 
quotient 10 


_ A>, x„} 

AP1_ J(S) 


J 


Thus, the coordinate ring of an algebraic set is a reduced, commutative K- 
algebra of finite type. The reader will establish a ‘concrete’ interpretation of this 
ring, as the ring of ‘polynomial functions’ on S, in Exercise 2.12. One way to think 
about the Nullstellensatz (in its manifestation as Corollary 2.18) is that, if K is 
algebraically closed, then every reduced commutative A'-algebra of finite type R 
may be realized as the coordinate ring of an affine algebraic set S: the points of S 
correspond in a natural way to the maximal ideals of R (Exercise 2.14). 

In fact, in basic algebraic geometry one defines the category of affine algebraic 
sets over a field K in terms of this correspondence: morphisms of algebraic sets 
are defined so that they match homomorphisms of the corresponding (reduced, 
finite-type) A'-algebras. (The reader will encounter a more precise definition in 
Example VIII. 1.9.) 

This dictionary allows us to translate every ‘geometric’ feature of an algebraic 
set (such as dimension, smoothness, etc.) into corresponding ‘algebraic’ features of 
the corresponding coordinate rings. Once these key algebraic features have been 
identified, then one can throw away the restriction of working on reduced finite- 
type algebras over an algebraically closed field and try to ‘do geometry’ on any (say 
commutative, Noetherian) ring. For example, it turns out that PIDs correspond 
to (certain) smooth curves in the affine geometric world; and then one can try to 
think of Z as a ‘smooth curve’ (although Z is not a reduced finite-type A'-algebra 
over any field A'!) and try to use theorems inspired by the geometry of curves to 
understand features of Z — that is, use geometry to do number theory 11 . 

Another direction in which these simple considerations may be generalized is 
by ‘gluing’ affine algebraic sets into manifold-like objects: sets which may not 
be affine algebraic sets ‘globally’ but may be covered by affine algebraic sets. A 
concrete way to do this is by working in projective space rather than affine space: 
the globalization process can then be carried out in a straightforward way at the 
algebraic level, where it translates into the study of ‘graded’ rings and modules — 
more notions for which, regretfully, we will find no room in this book (except for a 
glance, in §VIII.4.3). In the second half of the twentieth century a more abstract 
viewpoint, championed by Alexander Grothendieck and others, has proven to be 
extremely effective; it has led to the intense study of schemes, the current language 
of choice in algebraic geometry. We have encountered the simplest kind of schemes 
when we introduced the spectrum of a ring R, Spec/?, back in §111.4.3. 


lu The notation ‘A'[5]‘ is fairly standard, although it may lead to confusion with the usual 
polynomial rings; hopefully the context takes care of this. It makes sense to incorporate the name 
of the field in the notation, since one could in principle change the base field while keeping the 
'same' equations encoded in the ideal . / ( S’ ) . 

1 1 This is of course a drastic oversimplification. 
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Exercises 

2.1. O Prove Lemma 2.1. [§2.1] 

2.2. > Let k C k be an algebraic closure, and let L be an intermediate field. Assume 
that every polynomial f(x) e fcjaj C L[x] factors as a product of linear terms in 
L[x]. Prove that L = k. [§2.l] 

2.3. Prove that if k is a countable field, then so is k. 

2.4. > Let k be a field, let cj, . . . , c m e k be distinct elements, and let X \, . . . , A m 
be nonzero elements of k. Prove that 

— — f- • • • -f — — ^ 0. 

x- ci X- Cm 

(This fact is used in the proof of Theorem 2.9.) (§2.2] 

2.5. > Let K be a field, let A be a subset of K[x\, . . . ,a: n ], and let I be the ideal 
generated by A. Prove that y(A) — t'(f) in Aft. (§2.3) 

2.6. > Let K be your favorite infinite field. Find examples of subsets S C Aft which 
cannot be realized as V(I) for any ideal I C K\x\, . . . ,x„]. Prove that if K is a 
finite field, then every subset S C Aft equals V(J) for some ideal I C K[xi , . 

(§2.3] 

2.7. t> Let K be a field and n a nonnegative integer. Prove that the set of algebraic 
subsets of Aft is the family of closed sets of a topology on Aft. [§2.3] 

2.8. > With notation as in Definition 2.13: 

• Prove that the set \/7 is an ideal of R. 

• Prove that \/7 corresponds to the nilradical of R/I via the correspondence be- 
tween ideals of R/I and ideals of R containing I. 

• Prove that \/T is in fact the intersection of all prime ideals of R containing I. 
(Cf. Exercise V.3.13.) 

• Prove that I is radical if and only if R.jl is reduced. (Cf. Exercise 1II.3.13.) 
[§2.3] 

2.9. > Prove that every affine algebraic set equals y (I) for a radical ideal I. (§2.3) 

2.10. Prove that every ideal in a Noetherian ring contains a power of its radical. 

2.11. Assume a field is not algebraically closed. Find a reduced finite-type K- 
algebra which is not the coordinate ring of any affine algebraic set. 

2.12. t> Let if be an infinite field. A polynomial function on an affine algebraic 
set S C Aft is the restriction to S of (the evaluation function of) a polynomial 
f(x i, . . . , x n ) € K[x i, . . . , *„]. Polynomial functions on an algebraic S manifestly 
form a ring and in fact a ff-algebra. Prove that this A-algebra is isomorphic to 
the coordinate ring of S. (§2.3, §VIII.1.3, §VIII.2.3] 
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2.13. Let K be an algebraically closed field. Prove that every reduced commutative 
Jf-algebra of finite type is the coordinate ring of an algebraic set S in some aiiine 
space AJf. 

2.14. i> Prove that, over an algebraically closed field K, the points of an algebraic 
set S correspond to the maximal ideals of the coordinate ring A'[S] of S, in such a 
way that if p corresponds to the maximal ideal m p , then the value of the function 
/ € K[S] at p equals the coset of / in /’£'[S]/m p = K. (§2.3, VIII.1,8] 

2.15. — > Let K be an algebraically closed field. An algebraic subset S of A£ is 
irreducible if it cannot be written as the union of two algebraic subsets properly 
contained in it. Prove that S is irreducible if and only if its ideal -F(S) is prime, if 
and only if its coordinate ring K[S] is an integral domain. 

An irreducible algebraic set is ‘all in one piece’, like A£ itself, and unlike (for 
example) y{xy) in the aiiine plane A|^ with coordinates x, y. Irreducible aiiine 
algebraic sets are called (affine algebraic) varieties. (2.18) 

2-16. t> Let K he an algebraically closed field. The field of rational functions 
K(x is the field of fractions of K'jA#] = K[xi , . . . , #„); every rational 

function « = § (with G 0 and F, G relatively prime) may be viewed as defining 
a function on the open set AJ- s y(G); we say that a is ‘defined’ for all points in 
the complement of f{G). 

Let G € K[xi , . . . ,x„] be irreducible. The set of rational functions that are 
defined in the complement of F(G) is a subring of K(x\, . . . ,x„). Prove that this 
subring may be identified with the localization (Exercise V.4.7) of A'fA^-] at the 
multiplicative set (1, G, G 2 , G 3 , . . . }. (Use the Nullstellensatz.) 

The same considerations may be carried out for any irreducible algebraic set S, 
adopting as field of ‘rational functions’ K(S) the field of fractions of the integral 
domain AT[S], [2.17, 2.19, §6.3] 

2.17. -i Let A" be an algebraically closed field, and let m be a maximal ideal of 

K[x\ #„], corresponding to a point p of A’^. A germ of a function at p is 

determined by an open set containing p and a function defined on that open set; in 
our context (dealing with rational functions and where the open set may be taken 
to be the complement of a function that does not vanish at p) this is the same 
information as a rational function defined at p, in the sense of Exercise 2.16. 

Show how to identify the ring of germs with the localization AT[A^-] m (defined 
in Exercise V.4.11). 

As in Exercise 2.16, the same discussion can be carried out for any algebraic 
set. This is the origin of the name ‘localization’: localizing the coordinate ring of a 
variety V at the maximal ideal corresponding to a point p amounts to considering 
only functions defined in a neighborhood of p, thus studying V ‘locally’, ‘near p’. 
[V.4.7] 


2.18. -■ Let K be an algebraically closed field. Consider the two ‘curves’ C\:y — 
x 2 , f ?2 : y 2 = x 3 in A^ (pictures of the real points of these algebraic sets are shown 
in Example 2.12). 
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• Prove that K[Cy] = K[t] = while K[Ci\ may be identified with the 

subring K[t 2 , t 3 ] of K[t] consisting of polynomials ao + 02 1 2 -+-•■• + adt d with 
zero t-coefficient. (Note that every polynomial in K\x,y] may be written as 
f(x) H- 9( x )y + h(x, if) (if 2 — x 3 ) for uniquely determined polynomials f(x), g(x), 
h(x,y).) 

• Show that C\, Ci are both irreducible (cf. Exercise 2.15). 

• Prove that K[C\\ is a UFD, while K[Ci\ is not. 

• Show that the Krull dimension of both K [Cij and K[Ci] is 1. (This is why these 
sets would he called ‘curves’. You may use the fact that maximal chains of prime 
ideals in K[x, y\ have length 2.) 

• The origin (0,0) is in both Ci, Ci and corresponds to the maximal ideals tni, 
resp., m 2 , in K\Ci], resp., K (C 2 ), generated by the classes of x and y. 

• Prove that the localization if [C]] mi is a DVR (Exercise V.4.13). Prove that the 
localization K\Ci\ m2 is not a DVR. (Note that the relation y 2 = a; 3 still holds 
in this ring; prove that if [C 2 j ma is not a UFD.) 

The fact that a DVR. admits a local parameter, that is, a single generator for its 
maximal ideal (cf. Exercise V.2.20), is a good algebraic translation of the fact that 
a curve such as C\ has a single, smooth branch through (0, 0). The maximal ideal of 
if[C 2 ] m2 cannot he generated by just one element, as the reader may verify. [2.19] 

2.19. Prove that the fields of rational functions (Exercise 2.16) of the curves C\ 
and C 2 of Exercise 2.18 are isomorphic and both have transcendence degree 1 over k 
(cf. Exercise 1.27). 

This is another reason why we should think of C\ and Ci as ‘curves’. In fact, 
it can be proven that the Krull dimension of the coordinate ring of a variety equals 
the transcendence degree of its field of rational functions. This is a consequence of 
Noether’s normalization theorem, a cornerstone of commutative algebra. 

2.20. -1 Recall from Exercise VI.2.13 that P£- denotes the ‘projective space’ pa- 
rametrizing lines in the vector space K n+l . Every such line consists of multiples of 
a nonzero vector (co, . . . , c^) 6 K n+1 , so that may be identified with the quo- 
tient (in the set-theoretic sense of §1.1.5) of /f n+l s {(0, . . . , 0)} by the equivalence 
relation — defined by 

(co, ...Cn) ~ (<^,,...,<4) <=> (BA € K*) , (cj,, . . . , <4) = (Aq>, . . . , Ac„). 

The ‘point’ in Pg. determined by the vector (co, . . . , c„) is denoted (co:...:c„); 
these are the ‘projective coordinates’ 12 of the point. Note that there is no ‘point’ 
(0: . . . : 0). 

Prove that the function AJ- — > Pjj defined by 

(ci, - - - ,Cn) •-» (1 : Ci : . . . :c«) 

is a bijection. This function is used to realize A^ as a subset of PJf. By using 
similar functions, prove that P £ can be covered with n+ 1 copies of AJ-, and relate 

12 This is a convenient abuse of language. Keep in mind that the c,’s are not determined by 
the point, so they are not ‘coordinates’ in any strict sense. Their ratios are, however, determined 
by the point. 
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this fact to the cell decomposition obtained in Exercise VI.2.13. (Suggestion: Work 
out carefully the case n = 2.) [2.21, VIII.4.8] 

2.21. -i Let F(a;o, . . . , a; n ) £ • • • t x »] he a homogeneous polynomial. With 

notation as in Exercise 2.20, prove that the condition ‘F(co, ■ ■ . ,c n ) = O' for a 
point (co : . . . : Cn) £ PJ. is well-defined: it does not depend on the representative 
(efl, . . . , c n ) chosen for the points (co : . . . : c n ). We can then define the following 
subset of PJ-: 

r(F) := {(co : . . . : Cn) € Pfc | F(co , . . . , c«) = 0}. 

Prove that this ‘projective algebraic set’ can be covered with n + 1 affine algebraic 
sets. 

The basic definitions in ‘projective algebraic geometry’ can be developed along 
essentially the same path taken in this section for affine algebraic geometry, us- 
ing ‘homogenous ideals’ (that is, ideals generated by homogeneous polynomials; 
see §Vin.4.3) rather than ordinary ideals. This problem shows one way to relate 
projective and affine algebraic sets, in one template example. [VIII.4.8, VIII.4.11] 


3. Geometric impossibilities 

Very simple considerations in field theory dispose easily of a whole class of geometric 
problems which had seemed unapproacliable for a very long time — problems which 
preoccupied the Greeks and were only solved in the relatively recent past. 

These problems haw to do with the construction of certain geometric fig- 
ures with ‘straightedge and compass’, that is, subject to certain very strict rules. 
The practical utility of these constructions would appear to be absolute zero, but 
the intellectual satisfaction of being able to thoroughly understand matters which 
stumped extremely intelligent people for a very long time is well worth the little 
side trip. 

3.1. Constructions by straightedge and compass. We begin with two points 
O, P in the ordinary, real plane. You are allowed to mark (‘construct’) more points 
and other geometric figures in the plane, but only according to the following rules: 

• If yon have constructed two points A, B, then you can draw the line joining 
them (using your straightedge). 

• If you have constructed two points A, B, then you can draw the circle with 
center at A and containing B (using your compass). 

• You can mark any number of points of intersection of any two distinct lines, 
line and circle, or circles that you have drawn already. 

Performing these actions leads to complex (and beautiful) collections of lines and 
circles; we say that we have ‘constructed’ a geometric figure if that figure appears 
as a subset of the whole picture. 

For example, here is a recipe to construct an equilateral triangle with O, P as 
two of its vertices: 
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(1) draw the circle with center O, through P; 

(2) draw the circle with center P, through 0; 

(3) let Q be any of the two points of intersection of the two circles; 

(4) draw the lines through O, P; O, Q; and P, Q. 



Of course O, P , Q are vertices of an equilateral triangle. 

It is a pleasant exercise to try to come up with a specific sequence of basic moves 
constructing a given figure or geometric configuration. The enterprising reader 
could try to produce the vertices of a pentagon by a straightedge-and-compass 
construction, without looking it up and before we learn enough to thwart pure 
geometric intuition. 

The general question is to decide whether a figure can or cannot be constructed 
this way. Three problems of this kind became famous in antiquity: 

• trisecting angles; 

• squaring circles; 

• doubling cubes. 

For example, one assumes one has already constructed two lines forming au angle 9 
and asks whether one can construct two lines forming 0/3. This is trivially possible 
for some constructible 9: we just constructed an angle of tt/3, so evidently we were 
able to trisect the angle n; but can this be done for all constructible angles? 

The answer is no. Similarly, it is not possible to construct a square whose area 
equals the area of a (constructible) circle, and it is not possible to construct the 
side of a cube whose volume is twice as large as a cube with given (constructible) 
side. 

Field theory will allow us to establish all this 13 . Later we will return to these 
constructions and study the question of the coustructibility of regular polygons 
with a given side: it is very easy to construct equilateral triangles (we just did), 
squares, hexagons; it is not too hard to construct pentagons; but the question 


13 However, for the impossibility of squaring circles we will use the transcendence of ir, which 
we are taking on faith. 



3. Geometric impossibilities 


419 


of which polygons are constructive and which are not connects beautifully with 
deeper questions in field theory. 

The three problems listed above have an illustrious history; they apparently 
date back to at least 414 B.C., if one is to take this fragment from Aristophanes’ 
“ The birds” as an indication: 

METON: By measuring with a straightedge there 
the circle becomes a square with a court within. 1,1 

The problems depend on the precise restrictions imposed on the constructions. 
For example, one can trisect angles if one is allowed to mark points on the straight- 
edge (thus making it a rtder); in fact, this was already known to Archimedes (Ex- 
ercise 3.13). 

Translating these geometric problems into algebra requires establishing the fea- 
sibility of a few basic constructions. 

First of all, one can construct a line containing a given point A and perpendic- 
ular to a given line t (thus, t contains at least another constructive point B). For 
this, draw the circle with center A and containing B ; 

— if this circle only intersects t at B, then the hue through A and B is perpen- 
dicular to t: 



— otherwise, the circle intersects t at a second point C (whether A is on t or 
not): 



and once C is determined, the line through A and perpendicular to t may be found 
by joining the two points of intersection D, E of the two circles with centers at B, 
resp., C, and containing C, resp., B: 

14 We thank Matilde Marcolli for the translation. For those who are less ignorant than we are: 

|ierp^o<i> xavtSvt npo<7 , afletc *(va 
6 xuxXoc y«vy|Tfl(£ aoi Tsrpdnr<*>v<x xc tv otYOp<& 



420 


VII. Fields 



Applying this construction twice gives the line through a point A and parallel 
to a given line: 


4 


which is often handy. 

Judicious application of these operations allows us to bring a Cartesian refer- 
ence system into the picture. Starting with the initial two points O, F, we can 
construct perpendicular Cartesian axes centered at O, with P marking (say) the 
point (1, 0) on the ‘.-e-axis’: 


C> 




and constructing a point A = (x,y) is equivalent to constructing its projections 
X = ( x , 0), Y = (0, y) onto the axes, and in fact it is equivalent to constructing the 
two points X = (*,0) and Y' = (y, 0): 
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It follows that determining which figures are constructible by straightedge and 
compass is equivalent to determining which numbers may be realized as coordinates 
of a constructible point. 

Definition 3.1. A real number r is constructible if the point (r,0) is constructible 
with straightedge and compass (assuming O = (0,0) and P = (1,0), as above). We 
wall denote by the set of constructible real numbers. j 

Also, we can identify the real plane with C, placing O at 0 and P at 1, and we 
say that z = x+iy is constructible if the point (a, y) is constructible by straightedge 
and compass. We will denote by Q C the set of constructible complex numbers. 
Summarizing the foregoing discussion, we have proved: 

Lemma 3.2. A point (x, y) is constructible by straightedge and compass if and only 
if x + iy € V c , if and only if x,y e 

The next obvious question is what kind of structure these sets of constructible 
numbers carry, and this prompts us to look at a few more basic straightedge-and- 
compass constructions. 

Lemma 3.3. The subset C R of constructible numbers is a subfield of R. 
Likewise, tfc is a subfield ofC, and in fact tfc = ^r(i). 


Proof. The set tfy C R is nonempty, so in order to show it is a field, we only 
need to show that it is closed with respect to subtraction and division by a nonzero 
constructible number (cf. Exercise 11.6.2). 

The reader will check that is closed under subtraction (Exercise 3.2). To 
see that ^r is closed under division, let a, be ^r, with 6^0. Since A = (0, a) and 
B = (6, 0) are constructible by hypothesis, we can construct C as the j/- intersect 
of the line through P = (1,0) and parallel to the line through A and B, as in the 
following picture 15 : 


15 In the picture we are assuming o > 0 and b > 1; the reader will check that other possibilities 
lead to the same conclusion. 
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Since the triangles AOB and COP are similar, we see that C = (0,a/6); it follows 
that a /6 is constructible. 

The fact that ifc is a subfield of C is an immediate consequence of the fact that 
is a field, and the proof of this is left to the enjoyment of the reader (Exercise 3.7). 
The fact that — ^k(i) is a restatement of the fact that x + iy € if and only 
if x and y are in if®. □ 

\Ve may view and as extensions of Q. drawing a bridge between con- 
structibility by straightedge and compass and field theory: we will be able to under- 
stand constructibility of geometric figures if we can understand the field extensions 

QUrC <if c . 

Happily, we can understand these extensions! 

3.2. Constructible numbers and quadratic extensions. Our goal is to prove 
the following amazingly explicit description of (immediately implying one for 
%)• 

Theorem 3.4. Let 7 e R. Then 7 € < £r if and only if there exist real numbers 
Si,..., 6k such that Vj = 1 ,...,k 

m6i,....6j):Q(6i 6j-i)) = 2 

and 76 Q(Si,...,6k). 

In other words, 7 € R is constructible if and only if it can be placed in the top 
field of a sequence of (real) quadratic extensions over Q. Since tfc — %(»’), the 
same statement holds for constructible complex numbers, including i in the list of 
Sj (or simply allowing the 5j to be complex numbers; cf. Exercise 3.9). 

The proof of this theorem is as explicit as one can possibly hope. FVom a given 
straightedge-and-compass construction of a point (a:, y) one can obtain an explicit 
sequence 5\,...,5k as in the statement, such that x and y belong to the extension 
Q(<5i, . . . , 6k); conversely, from any element 7 of any such extension one can obtain 
an explicit construction of a point ( 7 , 0 ) by straightedge and compass. 

Proof. Let’s first argue in the ‘geometry to algebra’ direction. A configuration 
of points, lines, and circles obtained by a straightedge-and-compass construction 
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may be described by the coordinates of the points and the equations of the lines 
and circles. Suppose that at one stage in a given construction all coordinates of all 
points and all coefficients in the equations of lines and circles belong to a field F; 
we will say that the configuration is defined over F. 

Then we claim that for every object constructed at the next stage, there exists 
a number 5 £ R, of degree at most 2 over F, such that the new configuration is 
defined over F(5). The ‘only if’ part of the theorem follows by induction on the 
number of steps in the construction, since at the beginning the configuration (that 
is, the pair of points O = (0,0), P = (1,0)) is defined over Q. 

Verifying our claim amounts to verifying it for the basic operations defining 
straightedge-and-compass constructions. The reader will check (Exercise 3.5) that 
the point of inter-section of two lines defined over F has coordinates in F and that 
lines and circles determined by points with coordinates in F are defined over F. So 
5=1 works in all these cases. 


For the intersection of a line £ and a circle C, assume that t is not parallel to 
the p-axis (the argument is entirely analogous otherwise) and that it does meet C; 
let 

y = mi + r 

be the equation of t and let 

x 2 +y 2 +ax + by + c=0 


be the equation of C. We are assuming that a,b,c,m,r € F. Then the x- 
coordinates of the points of intersection of £ and C are the solutions of the equation 

x 2 + (mx + r) 2 + ox + b(mx + r) + c = 0. 

The ‘quadratic formula’ shows that these coordinates belong to the field F(y/D), 
where D is the discriminant of this polynomial: explicitly, 

D = (2m»- + bm + a) 2 — 4(m 2 + l)(r 2 + br + c), 

but this is unimportant. What is important is that D € F\ hence 5 = VD satisfies 
onr requirement. 

For the intersection of two (distinct) circles defined over F, nothing new is 
needed: if 

j x 2 + y 2 +a\x + biy+c\= 0, 

| x 2 + y 2 + azx + bzy + c% = 0 

are two circles, subtracting the two equations shows that their points of intersection 
coincide with the points of intersection of a circle and a line: 

{ x 2 + y 2 + a\x + b\y + cj = 0, 

(oi - a 2 )x + (bi - 62 y) + (cr - C2) = 0, 
with the same conclusion as in the previous case. 

This completes the verification of the ‘only if’ part of the theorem. 

To prove that every element of an extension as stated is corrstructible, again 
argue by induction: it suffices to show that if (i) 5 € ®, (ii) all elements of F are 
constructible, and (iii) r - 6 2 e F, then 5 is corrstructible (note that in order to 
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construct an element of degree 2 over F, it suffices to construct the square root of 
the discriminant of its minimal polynomial). Therefore, all we have to show is that 
we can ‘take square roots’ by a straigktedge-aud-compass construction. Here is the 
picture (if r > 1): 



If A = (r, 0) is constructible, so is B — (— r, 0); C is the midpoint of the segment BP 
(midpoints are constructible, Exercise 3.1); the circle with center C and containing 
P intersects the positive p-axis at a point Q = (0, 8), and elementary geometry 
shows that S 2 = r. Therefore 6 is constructible, concluding the proof of the theorem. 

□ 


Fbr example, one can construct an angle of 3°: allegedly 

cos3° = |(i/3+ l)yVTvi+ i(V5- V2)(VZ- 1), 
and this expression shows that 

cos 3° € Q(V2, \/3, \/5)()/5 + v / 5); 

that is (by Theorem 3.4), cos 3° is constructible. Of course once A = (cos 0,0) is 



Here is another example of a ‘constructive’ application of Theorem 3.4: 
Example 3.5. Regular pentagons are constructible. 
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Indeed, it suffices to construct the point A = (cos(2w/5),0), and it so happens that 
7 = cos(2tt/5) satisfies 

(*) 47 2 + 27 - 1 = 0 

(in fact, 7 is half of the inverse of the golden ratio: 7 = It follows that 

7 € Q(v / 5), and hence it is constructible by Theorem 3.4. In fact, the proof shows 
how to construct \/5, so the reader should now have no difficulty producing a 
straightedge-and-compass construction of a regular pentagon (Exercise 3.6). _i 

We will come back to constructions of regular polygons once we have studied 
field theory a little more thoroughly (cf. §7.2). By the way, how does one figure 
out that 7 = cos(2rr/5) should satisfy the identity (*)? This is easy modulo some 
complex number arithmetic; see Exercise 3.11. 

3.3. Famous impossibilities. Theorem 3.4 easily settles the three problems men- 
tioned in §3.1, via the following immediate consequence: 

Corollary 3.6. Let 7 6 % fed constructible number. Then [<Q("y) : Q] is a power 
of 2. 

Proof. By Lemma 3.3 and Theorem 3.4, there exist <5* e R such that 

7 6 Q(£b •••,$*.*)> 

and each 6j has degree < 2 over Q(fli <5y_ 1 ) . Repeated application of Proposi- 

tion 1.10 shows that 

[Q(5i,...,5fc,j) : Q] 

is a power of 2 , and since 

QCQ( 7 )CQ 

the statement follows from Corollary 1 . 11 . □ 

In particular, constructible real numbers must satisfy the same condition. 

Consider the problem of trisecting an angle. We know we can construct an 
angle of 60° (tills is a subproduct of the construction of an equilateral triangle). 
Constructing an angle of 20° is equivalent (Exercise 3.10) to constructing the com- 
plex number 7 on the unit circle and with argument ir/9: 
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60° 



Now 7 s = -1; that is, 7 satisfies the polynomial 

f 9 + 1 = (t 3 + l)(t« - t 3 + 1 ). 

It does not satisfy t 3 + 1; therefore its minimal polynomial over Q is a factor of 

f 6 - 1 3 + 1. 

However, this polynomial is irreducible over Q (substitute t t — 1, and apply 
Eisenstein’s criterion), so 

[Q(7) = Q] - 6 . 

By Corollary 3.6, 7 is not constructible. Therefore there exist constructible angles 
which cannot be trisected. 

Similarly, cubes cannot be doubled because that would imply the constructibil- 
ity of \/2, which has degree 3 over Q, contradicting Corollary 3.6. 

Squaring circles amounts to constructing it, and n is not even algebraic (al- 
though, as we have mentioned, the proof of this fact is not elementary), so this is 
also not possible. 

As another example, constructing a regular 7-gon would amount to constructing 
a 7-th (complex) root of 1, 



By definition, Q satisfies 

t 1 — \ = (t— l)(t 6 + 1 5 + f t + 1 ); 

as C 56 1, C must satisfy the cyclotomic polynomial t 6 H hi. This is irreducible 

(Example V.5.19); hence again we find that £ has degree 6 over Q, and Corollary 3.6 
implies that the regular 7-gon cannot be constructed with straightedge and compass. 

Of course ‘7’ is not too special: if p is a positive prime integer, the cyclotomic 
polynomial of degree p — 1 is irreducible (Example V.5.19 again); hence 

[Q(Cp):Q] = P-l, 

where is the complex p-th root of 1 with argument 2itjp. Therefore, Corollary 3.6 
reveals that if p is prime, then the regular p-gon can be constructed only if p— 1 is 
a power of 2. This is eveii more restrictive than it looks at first, since if p = 2* + 1 
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is prime, then necessarily k is itself a power of 2 (Exercise 3.15). Primes of the 
form 2 2 + 1 are called Fermat primes: 

3,5, 17, 257,65537 

(the ‘next one’, 2 32 + 1 = 4294967297 = 641 • 6700417, is not prime; in fact, no one 
knows if there are any other Fermat primes, and yet some people conjecture that 
there are infinitely many). 

These considerations alone do not tell us that if p is a Fermat prime, then the 
regular p-gon can be constructed with straightedge and compass; but they do tell 
us that the next case to consider wonld be p = 2 4 + 1 = 17, and it just so happens 
that 


1 IT </l7 - 1 -f 4+ Ov'lT + - 


as noted by Gauss at age 19. Therefore (by Theorem 3.4) the 17-gon is constructible 
by straightedge and compass. As we have announced already, the situation will be 
further clarified soon, allowing us to bypass heavy-duty trigonometry. 


Exercises 


3.1. > Prove that if A, B are constructible, then the midpoint of the segment AB 
is also constructible. Prove that if two lines ( % , are constructible and not parallel, 
then the two lines bisecting the angles formed by li and I 2 are also constructible. 

[§3-2] 

3.2. > Prove that if a, 6 are constructible numbers, then so is a — b. [§3.1] 

3.3. Find an explicit straightedge-and-compass construction for the product of two 
real numbers. 

3.4. Show how to square a triangle by straightedge and compass. 

3.5. > Let F be a subfield of R. 

• Let A = (xa,Va), B = (xb,vb) be two points in R 2 , with xa,VA, xb,Vb € F. 
Prove that the line through A, B is defined over F (that is, it admits an equation 
with coefficients in F). 

• Prove that the circle with center at A and containing B is defined over F. 

• Let f 1 , f 2 be two distinct, nonparallel lines in R 2 , defined over F, and let (x,y) = 
<1 n <j. Prove that x,y e F. 

[§3.2] 

3.6. > Devise a way to construct a regular pentagon with straightedge and compass. 
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3.7. t> Identify the (real) plane with C, and place O, P at 0, 1 6 C. Let be the 
set of all constructible points, viewed as a subset of C. Prove that is a subfield 
ofC. [§3.1] 

3.8. For 6 S C, 5 ^ 0, let 8g be the argument of 5 (that is, the angle formed by 
the line through 0 and 5 with the real axis). Prove that ^ b if and only if |5|, 
cos Of, sin Of are all constructible real numbers. 

3.9. t> Let 7 i , . . . , 76 € %'<c be constructible complex numbers, and let K be the 
field Q( 7 i, ... , 7 *) C Let 8 be a complex number such that [K (8) : K] = 2. 
Prove that 5 € ifc- 

Deduce that there are no irreducible polynomials of degree 2 over and that 
is the smallest sublield of C with this property. [§3.2] 

3.10. t> Prove that if two lines in any constructible configuration form an angle 
of 0, then the line through O forming an angle of 9 clockwise with respect to the 
line OP is constructible. 

Deduce that an angle 0 is constructible anywhere in the plane if and only if 
cos 0 is constructible. [§3.3] 

3.11. > Verify that 7 = cos(2rr/5) satisfies the relation (*) given in §3.2. (H 
<r = sin(2w/5), note that z = 7 + itr satisfies z 5 — 1.) [§3.2] 

3.12. Prove that the angles of 1° and 2° are not constructible. (Hint: Given what 
we know at this point, you only need to recall that there exist trigonometric formulas 
for the sum of two angles; the exact shape of these formulas is not important.) For 
what integers n is the angle n° constructible? 

3.13. > Prove that a = f in the following picture: 



This says that angles can be trisected if we allow the use of a ‘ruler’, that is, a 
straightedge with markings (here, the construction works if we can mark a fixed 
distance of 1 on the ruler). Apparently, this construction was known to Archimedes. 

[S3.1] 

3.14. Prove that the regular 9-gou is not constructible. 

3.15. > Prove that if 2 k + 1 is prime, then k is a power of 2. [§3.3] 


4. Field extensions, II 

It is time to continue our survey of different flavors of field extensions. The keywords 
here are splitting fields, normal, separable. 
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4.1. Splitting fields and normal extensions. Id §2 we have constructed the 
algebraic closure k of any given field k : every polynomial in /s[x] factors as a 
product of linear terms (that is, ‘splits’) in 3Si[a;], and X is the ‘smallest’ extension 
of k satisfying this property. 

Here is an analogous, but more modest, requirement: given a subset & C fc[as] 
of polynomials, construct an extension k C F such that every polynomial in & 
splits as a product of linear terms over F, and require F to he as small as pos- 
sible with this property. We then call F the splitting field for In practice we 
will only be interested in the case in w r hich SF is a finite collection of polynomi- 
als , f r (x)\ then requiring that each fi(x) splits over F is equivalent to 

requiring that the product fi(x) ■ ■ ■ f r (x) splits over F. 

That is, for our purposes it is not restrictive to assume that & consists of a 
single polynomial f(x) € /rfx]. 

Definition 4.1. Let k be a field, and let f(x) € be a polynomial of degree d. 
The splitting field for f(x) over k is an extension F of k such that 

d 

f(x) = cJJ(x- Oj) 
i = 1 

splits in F[x], and further F = k(a\, . . . .ay) is generated over k by the roots of 
f(x) in F. j 

Note that it is clear that o splitting field exists: given f(x) € fc[x], the subfield 
F C I generated over k by the roots of f(x) in H satisfies the requirements in 
Definition 4.1. But we have written the splitting field, and this is justified by the 
uniqueness part of the following basic observation, which also evaluates ‘how big’ 
this extension can be. 

Lemma 4.2. Let k be a field, and let f(x) € k\x\. Then the splitting field F for 
f(x) over k is unique up to isomorphism, and [F : &] < (deg/)!. 

In fact, if t : k 1 — ► k is any isomorphism of fields and g(x) 6 F[x] is such that 
f(x) = l{<j{x)), then i extends to an isomorphism of any splitting field of g(x) over 
k! to any splitting field of f(x) over k. 


Proof. We first construct explicitly a splitting field and obtain the bound on the 
degree mentioned in the statement. Then we prove the second part, which implies 
uniqueness up to isomorphism. 

The construction of a splitting field and the given bound on the degree are an 
easy application of our basic simple extension found in Proposition V.5.7. Arguing 
inductively, assume the splitting field has been constructed and the bound has 
been proved for all fields and all polynomials of degree (deg / — 1). Let q(t) be any 
irreducible factor of f(t) over fc; then 


kCF' 


HA 

(<?(*)) 


is an extension of degree deg q < deg/, in which q{x) (and hence f(x)) has a root 
(the coset a of t) and therefore a linear factor x - a. The polynomial h(x) := 
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/(x)/( x -a) 6 F'[x] has degree (deg / — 1); therefore a splitting field F exists for 
h(x), and 

[F : F'] < (deg / — 1)!. 

The factors of }{x) over F are (x - a) and the factors of h(x), which are linear; it 
follows that F is a splitting field for /(x) and 

(F : *] = [F : F'][F' : k] < (deg/)(deg/ - 1)! = (deg/)! 

as stated. 

To prove that isomorphisms i : k! — * k extend to isomorphisms of splitting 
fields as stated, let G be a splitting field for g(x) and consider the composition 
k' -* k C k. Since the extension k' C G is algebraic, by Lemma 2.8 there exists 
a morphism of extensions i : G —> k over k! . Since G is generated over k' by the 
roots of g(x) in G, we have 

i(G) = k(a u ---,Qd) CE, 

where the ctj’s are the roots of /(x) = t(g(x)) in k. Therefore L := t(G) is indepen- 
dent of the chosen isomorphism i and splitting field G. Applying this observation 
to the given i and to the identity k -* k proves the statement (as both G and F 
sure isomorphic to L). □ 

Example 4.3. By definition, Q(i) is the splitting field of x 2 + 1 over Q, and C is 
the splitting field for the same polynomial, over R. j 

Example 4.4. The splitting field F of x 8 - 1 over Q is generated by ( := e 2 *’ /8 : 
indeed, the roots of x 8 - 1 are all the 8-th roots of 1, and all of them are powers 
of 



In fact, £ is a root of the polynomial x 4 + 1, which is irreducible over Q; therefore 
F = Q(0 is ‘already’ the splitting field of x 4 + 1. The degree of F over Q is 

[Q«):Q]=4, 

way less than the bounds 8!, 4! obtained in Lemma 4.2. 

To understand this splitting field (even) better, note that i = £ 2 is in F, and 
so is \/2 = C + C 7 ; thus F contains Q(t, \/2). Conversely, ( = ^(1 + i) € Q(i, \/2). 
Therefore, the splitting field of x 4 + 1 (a.k.a. the splitting field of x 8 - 1) is Q(t, y/2). 
Analyzing Q C Q(t, \/2) (as we did for the extension in Example 1.19) shows that 
its group of automorphisms over Q is Z/2Z x Z/2Z. j 
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Example 4.5. Variation on the theme: x 4 — 1 . The situation changes if instead 
of x* + 1 we consider a: 4 — 1: this polynomial factors over Q, 

x* — 1 = (a: — l){ac + l)(x 2 + 1), 

and it follows that the splitting field is the same as for x 2 + 1, that is, just Q(i). _j 

Example 4.6. Variation on the theme: x 4 + 2. The overoptimistic reader may 
now hope that the difference between the splitting fields of ar + 1 vs. x 4 — 1 over 
Q is just due to the fact that the first polynomial is irreducible over Q and the 
second is not. This example will nip any such guess in the bud. With notation as 
in Example 4.4, the roots of a’ 4 + 2 are 

</2<Z, v^C 3 , v^C s , ^2C 7 - 

Therefore, with K = Q( v^2£, v^2C S > V^C 7 ) the splitting field of a; 4 + 2, 

K C Q(C, </2) = Q(i, v/2, #2) = Q(i, ^2). 

On the other hand, \/2 = ( v^2£) a /( v^2f 3 ) € K-, hence i = (\/2Q) 2 /y/2 € K\ hence 
< = ^(1 + i) € K\ hence <f2 = (^2C)/t 6 K. Therefore, Q(i, C K, and the 
conclusion is that the splitting field of * 4 +2 equals K = Q(i, v^2). A simple degree 
computation (Exercise 4.3) show's \K : Q] = 8 and in particular the splitting field 
of x 4 + 2 is certainly not isomorphic to the splitting field of a.- 4 + 1. _i 

Examples such as these have always left me with the impression that field 
theory must be rather mysterious: innocent-looking variations on the parameters 
of a problem may cause dramatic changes in the corresponding field extensions. On 
the plus side, this hints that field theory' can indeed be a very precise tool in (for 
example) the study of polynomials. 

Splitting fields will play an important role in the rest of the story. They axe 
even more special than they may appear to be at first: it turns out that not only do 
they split the given polynomial, but they also automatically split any irreducible 
polynomial which dares touch them with a root. This makes splitting fields normal 
extensions: 

Definition 4.7. A field extension k C F is normal if for every irreducible poly- 
nomial f(x) € &[#], f(x) has a root in F if and only if fix) splits as a product of 
linear factors over F. j 

Theorem 4.8. A field extension kC F is finite and normal if and only if F is the 
splitting field of some polynomial f(x) € fc[a]. 

Proof. Assume k C F is finite and normal. Then F is finitely generated: F = 
k(ai, . . .,a T ) with a,- algebraic over k. Let Pi(t) be the minimal polynomial of a; 
over k. As F is normal over fc, each pj(t) splits completely over F, and hence so 
does f[t) = pi{t) ■■■p T {t). It follows that F is the splitting field of f{x). 

Conversely, assume that F is a splitting field for a polynomial f(x) € fc[a], and 
let p(a) e &[a] be an irreducible polynomial, such that F contains a root a of p(i-). 
Viewing F as a subfield of the algebraic closure H, let /3 € T be any other root of 
p(i); we are going to show that (3 € F. This will prove that F contains all roots of 
p{x), implying that kC F is normal, and fc C F is finite by Lemma 4.2. 
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By Proposition 1.5, there exists an isomorphism t : A(a) — ♦ k(0) extending the 
identity on k and sending a >-* 0. We also consider the subfield F(0) C A, viewed 
as an extension of k(0). Putting everything in one diagram: 



A(q) c — 

1, 

* F 

~>k 

I 

A(/3)C — 

— ► F(0)<— 

— * k 


(Note: If we join the two copies of k on the right by the identity map, the corre- 
sponding diagram does not commute: t sends a € k in the top row to 0 G k in the 
bottom row.) Now observe that F may be viewed as the splitting field of f(x) over 
k(a): indeed, it contains all the roots of f(x) and is generated over A (and hence 
over k(a)) by these roots. By the same token, F(0) is the splitting field for f{x) 
over k(0). By Lemma 4.2, i extends to an isomorphism t' : F — ► F(0). Since i 
restricts to the identity on k, i 1 is an isomorphism of A-vector spaces; in particular, 
dim* F = dim* F(0) (keep in mind that splitting fields are finite extensions). 

Now consider the different A-linear map of A-vector spaces i : F — > F(0) simply 
given by the inclusion within k: 

kCFC F(0) C A. 

Since F and F{0) are A-vector spaces of the same finite dimension, i must also be 
an isomorphism. In other words, 

[F(0) : F] = 1; 

that is, 0 G F as needed. □ 


Remark 4.9. This argument is somewhat delicate. With notation as in the proof, 
the diagram 

k(a) c ► F 


A(/3)C > F(<?) 

is commutative, while the diagram 


fc(«) < - 


k(0Y~ 


-+F 


->F i 




is not commutative if /3 / a. Indeed, t sends a to 0 , while i sends a to a. The 
key point in the ar gument is the observation that if there exists one isomorphism 
between two finite-dimensional vector spaces V, W, then every injective linear map 
V — » IV must be an isomorphism. Finite dimensionality is necessary in order to 
draw this conclusion; cf. Exercise VI.6.5. j 


Example 4.10. If a complex root of an irreducible polynomial p(x) G Q[ar] may 
be expressed as a polynomial in i and \/2 with rational coefficients, then all roots 
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of p(x) may be expressed likewise in terms of i and v/2. Indeed, we have checked 
(Example 4.6) that Q( i, v/2) is a splitting field over Q; hence it is a normal extension 
of Q. j 

4.2. Separable polynomials. Our intuition may lead us to think that if a poly- 
nomial factors as a product of linear factors and we are not ‘purposely’ repeating 
one of the factors (as in (.r - l) 2 (x - 2)), then these will be distinct. For example, 
surely irreducible polynomials necessarily split as products of distinct factors in an 
algebraic closure, right? Wrong. 

Example 4.11. Let p be a prime, and consider the field F ,,(t) of rational functions 
over F ;j . Then the polynomial 

x>‘ - t € F,,(<)(.r] 

is irreducible: by Eisensteiu’s criterion it is irreducible in F p [<][:r] (since (t) Is prime 
in F p [<]), hence in F p (<)[x] by Proposition V.4.16. Let u be a root of this polynomial 
in ail extension L of F ,,(<) (for example L could be the algebraic closure of F p (<), 
or more modestly a splitting field for the polynomial). Then (Exercise 4.8) 

x>‘ -t = (x- u) 1 ’ 

in L [.)']; that is, u has multiplicity p as a root of f(x). 

In other words, the minimal polynomial over F p (<) of u € L vanishes p times 
at u. and there is nothing to do about this: no smaller power of (x - u) than 
(.r - u) p = x p - t has coefficients in F p (f ) (a smaller power would give a nontrivial 
factor of x 1 ' - t , and x 1 ’ - t is irreducible). j 

We have always found this example difficult to visualize, because of intuition 
developed in characteristic 0, and as we will see, no such pathology can occur in 
characteristic 0. 

Definition 4.12. Let k be a field. A polynomial f(x) 6 fc[ar] is separable if it 
has no multiple factors over its splitting field; f(x) is inseparable if it has multiple 
factors over its splitting field. j 

Thus, the polynomial x v — t in Example 4.11 is inseparable. We suppose the 
terminology reflects the fact that we cannot ‘separate’ its roots: they come clumped 
together like quarks in a proton, and we cannot take them apart. 

Of course Definition 4.12 does not depend on the chosen splitting field, since 
these are unique up to isomorphism (Lemma 4.2). In fact, to detect separability, 
we can use any field in which the polynomial splits as a product of linear factors 
(by Exercise 4.1). The first, somewhat surprising, observation about separability is 
that we can in fact detect it without leaving the field of coefficients of f(x). This 
fact uses a notion borrowed from calculus: for a polynomial 

f(x) = a ( | + «i:r + a- 2 X 2 H h a„x n , 

we denote by f'(x) the ‘derivative’ 

f'(x) = a t + 2a?x -I + na„a:" -1 . 
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Of course this is a purely formal operation; no limiting process is at work over 
arbitrary fields. Still, all expected properties of derivatives hold, as the reader 
should check: for example, (fg)' = f'g + fgf as usual. 

Lemma. 4.13. Let k be a field, and let f(x) € fc(x] . Then f(x) is separable if and 
only if /(x) and f'(x) are relatively prime. 

By definition, f(x) and f(x) are relatively prime precisely when the greatest 
common divisor of f(x) and f'(x) is 1. Note that if k C F, the gcd of f(x) and 
f'(x) is the same whether it is considered in &[x] or in F[xj: for example because it 
can be computed by applying the Euclidean algorithm (fjV.2.4), and this proceeds 
in exactly the same way whether it is performed in &[x) or in FJx). 

Proof. First assume that f(x) is not separable. Then f(x) has a multiple root in 
a splitting field F\ that is, 

/(x) = (x-a) m p(x) 

for some a e F, g(x) € F[xj, and m > 2. Therefore 

f'(x } = m(x- a) m ~ l g(x) + (x- a) m g'(x), 

and it follows that (sc — a) is a common factor of /(x) and f'(x) in F[x). Therefore 
gcd(/(x), f'(x)) ^ 1 in F[x]; hence gcd(/(x), /'(x)) ^ 1 in fc[x]; that is, f(x), f'(x) 
are not relatively prime. 

Conversely, assume gcd(/(x), f(x)) ^ 1, so f(x) and f(x) have a common 
irreducible factor (x— a) in the algebraic closure k of k. Write /(x) = (x — a)h(x); 
we have 

f(x) = h(x) + (x- a)h'(x), 

and it follows that x — a divides h(x) since it divides f'{x). But then 

(x - o) 2 | /(x); 

hence /(x) is inseparable. □ 

For example, the polynomial x? — t of Example 4.11 may be seen to be insepa- 
rable without invoking splitting fields: the derivative of x p — t equals px p ~ l = 0 in 
characteristic p, and gcd(x p — t, 0) = x p — t 1. 

This example captures one of the key features of inseparability: 

Lemma 4.14. Let k be a field, and let /(x) € fc|x] 6e an inseparable irreducible 
polynomial. Then f'(x) = 0. 

Proof. Since /(x) is inseparable, /(x) and /'(x) have a common irreducible factor 
q(x) by Lemma 4.13; but as f(x) is itself irreducible, q(x) must be an associate 
of /(x), and in particular its degree is larger than the degree of /'(x) if f'(x) =£ 0. 
As q(x) | /'(x), the only option is that /'(x) = 0. □ 

This already tells us that irreducible polynomials are necessarily separable in 
characteristic 0: in characteristic 0, the derivative of a nonconstant polynomial 
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clearly cannot vanish. In fact, Lemma 4.14 gives us a precise picture of what 
inseparable, irreducible polynomials must look like. If 

/(*) = E “;s‘ 

i= 0 

is irreducible and inseparable, then the characteristic of the field must be a positive 
prime p, and by Lemma 4.14 we must have 

/(*) = ^2 ia i xi ~ l = °5 

x=0 

that is, ia-i = 0 for all i. Now m; = 0 is automatic if i is a multiple of p, and it 
implies ai = 0 for all the indices * which are not multiples of p. Therefore, the only 
nouvanishing coefficients in f(x) must be those corresponding to indices which are 
multiples of p: 

f(x) = ac + a p 'j? + a2 P x 2p + . . . ; 

therefore, f[x) must in fact be a polynomial m x v . (The reader will refine this 
picture further by working out Exercise 4.13 and following.) 

This leads us to a precise result linking separability and a flavor of fields that 
we have not yet run into. 

Definition. 4.15. Let k be a field of characteristic p > 0. The Frobenius homo- 
morphism is the map & — ► k defined by x x p . j 

The Frobenius homomorphism may not look like a homomorphism of rings, but 
it is (Exercise 4.8). It must be injective, as is every nontrivial ring homomorphism 
from a field; but it is not necessarily surjective. 

Definition 4.16. A field k is perfect if char k = 0 or if char k > 0 and the Frobenius 
homomorphism is surjective. _i 

Proposition 4.17. Let k be a field. Then k is perfect if and only if all irreducible 
polynomials in Is [a:] are separable. 

Proof. We will prove that irreducible polynomials over a perfect field are separable, 
leaving the other implication to the reader (Exercise 4.12). 

We have already noted that irreducible polynomials are separable over fields of 
characteristic zero. In positive characteristic p, we have observed that an insepara- 
ble irreducible polynomial must be of the form 

/(«)=£>.(«>)<. 

i=0 

Since Frobenius is surjective, there exist bi such that fef = a*. Thus 

/(*) = E %(* p r = = (e k.* 1 ) = s(s) p , 

»=0 i- 0 \i= 0 / 

where y{x) = biX x and keeping in mind that the Frobenius map is a homomor- 
phism. But this contradicts the irreducibility of f{x), so there is no such polyno- 
mial. □ 
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Corollary 4.18. Finite fields are perfect. Therefore, over Unite fields, irreducible 
polynomials are separable. 

Proof. The FVobeniua map is injective (because it is a homomorphism of fields), so 
it is surjective over finite Helds, by the pigeon-hole principle. Therefore finite fields 
are perfect, and the second part of the statement follows from Proposition 4.17. □ 

The reader now sees why we haw chosen the somewhat unusual field F p (t) 
in Example 4.11: we need a field of positive characteristic, but no example of 
irreducible inseparable polynomial can be found over F p , by Corollary 4.18; to 
concoct an example, we need to enlarge the field so that it is infinite and to throw 
in an element (that is, f) for which there is no p-th root, that is, which is not in 
the image of Frobenius. 

4.3. Separable extensions and embeddings in algebraic closures. The ter- 
minology examined in the previous section extends to the language of field exten- 
sions. If k C F is an extension and a € F, we say that a is separable over k if the 
minimal polynomial of a over k is separable; a is inseparable otherwise. 

Definition 4.19. An algebraic field extension k C F is separable if every a € F is 
separable over k. -i 

With this terminology, Proposition 4.17 may be restated in the following more 
impressive form: 

Proposition 4.20. A field k is perfect if and only if every algebraic extension of 
k is separable. 

In particular, algebraic extensions of Q (or any field of characteristic zero) and 
of every finite field are necessarily separable. 

The separability condition is exceedingly convenient, and we will essentially 
adopt it henceforth: all extensions we will seriously consider will be separable. One 
convenient feature of separable extensions is the following alternative description 
of the separability condition. 

We have seen (Lemma 2.8) that every algebraic extension k C F may be 
embedded in an algebraic closure k C k, and we pointed out that this can in general 
be done in many different ways. If finite, the number of different homomorphisms 
F —* k extending the identity on k is denoted by 

[F:k) s . 

Definition 4.21. This is the separable degree of F over k. j 

It is clear that this number is independent of the chosen algebraic closure k C E. 
What does it have to do with separability? 

Lemma 4.22. Let k C k(a) be a simple algebraic extension. Then [fc(a) : fc] s equals 
the number of distinct roots in k of the minimal polynomial of a. In particular, 
[fc(cr) : At], < [fe(o) : fc], vrith equality if and only if a is separable over k. 
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Proof. The proof is essentially (and not by coincidence) a rehash of the proof of 
Corollary 1.7. Associate with each t : k(a) — * k extending id* the image i(a), which 
must he a root of the minimal polynomial of a. This correspondence is injective, 
since i(«) determines / (as t extends the identity on k). To see it is surjective, let 
ii € k he any other root, and consider the extension k(0) C A; by Proposition 1.5 
there is an isomorphism k(a) — » k(d) sending a to 0, and composing with the 
embedding k(j3) C k defines the corresponding i, as needed. □ 

Thus, the 'separable degree’ does detect separability, for simple extensions. 
Further, it is multiplicative over successive extensions: 

Lemma 4.23. Let k C E C F be algebraic extensions. Then [F : A] s is finite if 
and only if both [F : F]„. [E : A], are finite, and in this case 

[F : A], = [F : F]„[F : A] s . 

Proof. Different embeddings of E into k extend to different embeddings of F into k. 
by Lemma 2.8: and embeddings of F into E = k extending the identity on E extend 
a fortiori the identity on k. Therefore, if any of [F : E}„, [F : A - ]* is infinite, then 
so is [F : A] s . 

For the converse implication, it suffices to prove the stated degree formula. But 
every embedding F C k extending the identity on k may be obtained in two steps: 
first extend the identity to an embedding E C k, which can be done in [£ : A]„ ways: 
and then extend the chosen embedding E C A = E to an embedding F C E = k. 
which can be done in [F : E] a ways. There are precisely [F : E\ a [E : k] a ways to 
do this, as stated. □ 

Lemmas 4.22 and 4.23 allow us to recast separability of finite extensions entirely 
in the light of counting embeddings in an algebraic closure: 

Proposition 4.24. Let k C F be a finite extension. Then [F : k\ a < [F : k). and 
the following are equivalent: 

(i) F = k(ati, . . . , a, ), where each a,- is separable over k; 

(ii) kCF is separable: 

(iii) [F : A|„ = [F : A). 

Proof. Since F is finite over A, it is finitely generated. Let F = A(oq, . . . ,a r ). 
Then using Lemma 4.23, Lemma 4.22, and Proposition 1.10, 

[F : A], = [A(«i a,._i)(a,.) : A(<»i, ... ,Q r _i )]*••• [A(a t ) : A]„ 

< [k(Qi a,._,)(a r ) : A(a lt . . . ,a r _i)] • • • [A(«i) : A] 

= [F:A], 

This proves the stated inequality. 

(i) ==> (iii): If each a* is separable over A, then it is separable over the field 
A(oi, . . . , Oj_i) (Exercise 4.15), so the inequality is an equality by Lemma 4.22. 
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(iii) => (ii): Assume [F : fc] s = [F : As], and let a € F. We have fc C k(a) C F; 
hence by Lemma 4.23 

[F : k(a)},[k(a) : fc], = (F ; fc], = |F : k) = (F : fc(a)][fc(a) : fc]. 

Since both separable degrees are less than or equal to their plain counterparts, the 
equality implies 

[fc(o) : k\ a = [k(a ) : k\, 

proving that a is separable, by Lemma 4.22. Therefore the extension k C F is 
separable according to Definition 4.19. 

(ii) => (i) is immediate from Definition 4.19, since finite extensions are finitely 
generated. □ 

Fbr example, if a is separable over k, then every (3 6 k(a) is separable. This 
would seem rather mysterious from the definition alone: why should the fact that 
the minimal polynomial of a has distinct roots in k imply the same for the minimal 
polynomial of /?? It does, by virtue of Proposition 4.24. 

Remark 4.25. While we haw only proved [F : fc], < [F : fc], one can in fact prove 
that (F : fc], divides [F : fc]: in fact, (F : fc] 8 is the degree of a certain intermediate 
field Faep over fc. The diligent reader will prove this in Exercise 4.18. 


Exercises 


4.1. fc> Let fc be a field, f(x) e fcj®], and let F be the splitting field for f(x) over 
fc. Let fc C K be an extension such that f(x) splits as a product of linear factors 
over K. Prove that there is a homomorphism F —* K extending the identity on fc. 
I§4-2] 

4.2. Describe the splitting field of x 6 + x 3 + 1 over Q. Do the same for x 4 + 4. 

4.3. > Find the order of the automorphism group of the splitting field of a; 4 + 2 
over Q (cf. Example 4.6). [§4.1] 

4.4. Prove that the field Q(v^2) is not the splitting field of any polynomial over Q. 

4.5. > Let F be a splitting field for a polynomial f(x) € fc[®], and let g(x) € fc[a;] 
be a factor of f{x). Prove that F contains a unique copy of the splitting field of 
3(x). [§5.1] 

4.6. Let fc C Fi, fc C be two finite extensions, viewed as embedded in the 
algebraic closure fc of fc. Assume that F\ and F j are splitting fields of polynomials 
in fc[ar] . Prove that the intersection Ft n F 2 and the composite F 1 F 2 (the small- 
est subfield of fc containing both Fj and F 2 ) are both also splitting fields over fc. 
(Theorem 4.8 is likely going to be helpful.) 

4.7. > Let fc C F = fc(a) be a simple algebraic extension. Prove that F is normal 
over fc if and only if for every algebraic extension F CK and every a e Autt(fi'), 
<t(F) = F. (§6.1] 
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4.8. o Let p be a prime, and let A be a field of characteristic p. For a, b € A', prove 
that 16 (a + by = a? + 6^. [§4.2, §5.1, §5.2] 

4.9. Using the notion of ‘derivative’ given in §4.2, prove that (fg)' = f'g + fg' for 
all polynomials /, g. 

4.10. Let A- C F be a finite extension in characteristic p > 0. Assume that p does 
not divide [F : A]. Prove that A C F is separable. 

4.11. t> Let p be a prime integer. Prove that the FVobenius homomorphism on F,, 
is the identity. (Hint: Fermat.) [§5.1] 

4.12. > Let A be a field, and assume that A is not perfect. Prove that there are 
inseparable irreducible polynomials in A[x]. (If char A = p and u € A, how many 
roots does x p - u have in A?) [§4.2] 

4.13. t> Let A be a field of positive characteristic p, and let f(x) be an irreducible 
polynomial. Prove that there exist an integer d and a separable irreducible polyno- 
mial /sep(.r) such that 

/(*) = 

The number p' 1 is called the inseparable degree of f(x). If }(x) is the minimal 
polynomial of an algebraic element a, the inseparable degree of a is defined to 
be the inseparable degree of /(x). Prove that a is inseparable if and only if its 
inseparable degree is > p. 

The picture to keep in mind is as follows: the roots of the minimal polyno- 
mial /(x) of a are distributed into deg / sep ‘clumps’, each collecting a number of 
coincident roots equal to the inseparable degree of a. We say that a is ‘purely 
inseparable’ if there is only one clump, that is, if all roots of /(x) coincide (see 
Exercise 4.14). [§4.2,4.14,4.18] 

4.14. -i Let A C F be an algebraic extension, in positive characteristic p. An 
element a g F is purely inseparable over A if a p ‘ € A for some 17 d > 0. The 
extension is defined to be purely inseparable if every a € F is purely inseparable 
over A. 

Prove that a is purely inseparable if and only if [A(a) : A], = 1, if and only if 
its degree equals its inseparability degree (Exercise 4.13). [4.13, 4.17] 

4.15. t> Let A C F be an algebraic extension, and let a € F be separable over A. 
For every intermediate field A C E C F, prove that a is separable over E. [§4.3] 

4.16. -> Let A C E C F be algebraic field extensions, and assume that A C E is 
separable. Prove that if a € F is separable over F, then A C E(a) is a separable 
extension. (Reduce to the case of finite extensions.) 

Deduce that the set of elements of F which are separable over A form an inter- 
mediate field F*.,„ such that every element o € F, a # F wp is inseparable over A. 

For F = A, Ast-p is called the separable closure of A. [4.17, 4.18] 

’‘’This is sometimes referred to as the freshman’s dream, for painful reasons that are likely 
all too familiar to the reader. 

17 Note the slightly annoying clash of notation: elements of k are not inseparable, yet they 
are purely inseparable according to this definition. 
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4.17. -i Let k C F be an algebraic extension, in positive characteristic. With 
notation as in Exercises 4.14 and 4.16, prove that the extension F^p C F is purely 
inseparable. Prove that an extension k C F is purely inseparable if and only if 
F wp = k. [4.18J 

4.18. > Let k C F be a finite extension, in positive characteristic. Define the 
inseparable degree (F : fc]j to be the quotient (F : k]/[F : fc] s . 

• Prove that [&(<*) : &]; equals the inseparable degree of a, as defined in Exer- 
cise 4.13. 

• Prove that the inseparable degree is multiplicative: if k C E C F are finite 
extensions, then [F : k]i = [F : E\j\E : &)*. 

• Prove that a finite extension is purely inseparable if and only if its inseparable 
degree equals its degree. 

• With notation as in Exercise 4.16, prove that [F : fc) s = [F^p : k\ and [F : fc]; = 
[F : Fsep]. (Use Exercise 4.17.) 

In particular, (F : fc) s divides (F : k]; the inseparable degree [F : fc]* is an integer. 
[§4-3] 

4.19. -i Let k C F be a finite separable extension, and let tj , . . . , u be the distinct 
embeddings of F in k extending id*. For a € F, prove that the norm JVfc C p(oi) 

j 

(cf. Exercise 1.12) equals I"It=i l -d a ) and its trace tr*cF(ai) (Exercise 1.13) equals 
)C?=i r*(o). (Hint: Exercises 1.14 and 1.15.) [4.21, 4.22, 6.15J 

4.20. -i Let k C F be a finite separable extension, and let a € F. Prove that for 
all cr € Aut*(F), 1 V*cf(o i/ct(q)) = 1 and tr*cf( Q — <r(ct)) = 0. [6.16, 6.19] 

4.21. -> Let k C E C F be finite separable extensions, and let a € F. Prove that 

NkcF(<x) - NkcE(NBcF(a)) and tr*cF(«) = trkcE(tr£CF(o:)). 

(Hint: Use Exercise 4.19: if d = [F : &] and e = [F : F], the de embeddings of F 
into k lifting idj, must divide into d groups of e each, according to their restriction 
to E.) 

This ‘transitivity’ of norm and trace extends the result of Exercise 1.15 to 
separable extensions. The separability restriction is actually unnecessary; cf. Exer- 
cise 4.22. (4.22] 

4.22. Generalize Exercises 4.19 — 4.21 to all finite extensions k C F. (For the norm, 
raise to power (F : fc]»; for the trace, multiply by (F : A],;.) 


5. Field extensions, III 

The material in §4 provides us with the main tools needed to tackle several key 
examples. In this section we study finite and cyclotomic fields, and we return to 
the question of when a finite extension is in fact simple (cf. Example 1.19). 
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5.1. Finite fields. Let F be a finite field, and let p be its characteristic. We know 
(§1.1) that F may be viewed as an extension 

F P CF 

of F p = Z/pZ; let d = [F : F p ]. Since F has dimension d as a vector space over F P , 
it is isomorphic to as a vector space, and in particular |F| = p d is a power of p. 

The general question we pose is whether there exist fields of cardinality equal 
to every power of a prime, and we aim to classifying all fields of a given cardinality. 
The conclusion we will reach is as neat as it can he: for every prime power q there 
exists exactly one field F with q elements, up to isomorphism. 

Also recall (Remark III. 1.16) that, by a theorem of Wedderburu, every finite 
division ring is in fact a finite field. Thus, relaxing the hypothesis of commutativity 
in this subsection would not lead to a different classification. 

Theorem 5.1. Let q = p d be a power of a prime integer p. Then the splitting field 
of the polynomial x q — x over ¥ p is a field with precisely q elements. 

Conversely, let F be a field with exactly q elements; then F is a splitting field 
for x q — x over ¥ p . The polynomial x q — x is separable over ¥ p . 

Proof. Let F be the splitting field of x q — x over F p . Let E be the set of roots 
of f(x) = x q — x in F. Since f'{x) = qx q ~ l — 1 = — 1 (as q = 0 in character- 
istic p), we have (/(#), /'(*)) = 1; hence (Lemma 4.13) f(x) is separable, and E 
consists of precisely q elements. We claim that E is a field, and it follows that 
E = F. Indeed, F is generated by the roots of f(x); hence the smallest subfield 
of F containing E is F itself. 

To see that E is a field, let a, be E. Then a q = a and b q = 6; it follows that 
(o-5)« = a» + (-l)«fr’ = a-6 

(using Exercise 4.8; note that (— l) 9 = — 1 if p is odd and (— l) 9 = +1 = — 1 if 
p = 2). IfMO, 

(ai -1 ) 9 = a q (b q )~ l = ab~ l . 

Thus E is closed under subtraction and division by a nonzero element, proving that 
E is a field and concluding the proof of the first statement. 

To prove the second statement, let F be a field with exactly q elements. The 
nonzero elements of F form a group under multiplication, consisting of q — 1 ele- 
ments; therefore, the (multiplicative) order of every nonzero a € F divides q — 1 
(Example II.8.15). Therefore, 

a 0 =S a 9-1 = 1 ==> a q — a = 0; 

of course 0 9 - 0 = 0. In other words, the polynomial x q - x has q roots in F (that 
is, all elements of F!); it follows that F is a splitting field for x q — x, as stated. □ 

Corollary 5.2. For every prime power q there exists one and only one finite field 
of order q, up to isomorphism. 


Proof. This follows immediately from Theorem 5.1 and the uniqueness of splitting 
fields (Lemma 4.2). □ 
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Since there is exactly one isomorphism class of fields of order q for any given 
prime power q, we can devise a notation for a field of order q\ it makes sense to 
adopt 18 F 9 . This is called the Galois field of order q. 

Example 5.3. Let p be a prime integer. Then we claim that the polynomial x 4 + 1 
is reducible 19 over F p (and therefore over every finite field). 

Since x 4 + 1 = (x + l) 4 in Fjfx], the statement holds for p = 2. Thus, we may 
assume that p is an odd prime. Then we claim that x 4 + 1 divides x v — x. Indeed, 
the square of every odd number is congruent to 1 mod 8 (Exercise II.2.11); hence 
8 | (p 2 — 1); hence sc 8 — 1 divides x p2_1 — 1 (Exercise V.2.13); hence 

(x 4 + 1) | (x 8 - 1) | (x” 2 - 1 - 1) | (x p * - x). 

It follows that x 4 + 1 factors completely in the splitting field of x p — x, that is, 
in F p 2 . If a is a root of x 4 + 1 in F p 2 , we have the extensions 

F p C r„(o) C F p 2j 

therefore (Corollary 1.11) [F p (a) : F p ] divides [Fp 2 : F p ] = 2. That is, a has degree 1 
or 2 over F p . But then its minimal polynomial is a factor of degree 1 or 2 of (x 4 + 1), 
showing that the latter is reducible. _i 

Theorem 5.1 has many interesting consequences, and we sample a few in the 
rest of this subsection. 

Corollary 5.4. Let p be a prime, and let d< e be positive integers. Then there is 
an extension F p d C F„. if and only if d\e. Further , if d | e, then there is exactly 
one such extension , in the sense that F p * contains a unique copy of F p d . 

All extensions F p e C F p « are simple. 

Proof. If there is an extension as stated, then F p C Fpd C Fp. ; hence [Fpd : F p ] 
divides [F p « : F p ] by Corollary 1.11. This says precisely that d | e. 

Conversely, assume that d\e. As 

p=-l = (p d -l)((p‘')f- 1 +... + 1 ), 

we see that p d — 1 divides p e — 1, and consequently x pd ~ l — 1 divides x p ' -1 — 1 
(Exercise V.2.13). Therefore 

(x pJ — x) | (x p * - x). 

By Theorem 5.1, F p * is a splitting field for the second polynomial. It follows that it 
contains a unique copy of the splitting field for the first polynomial (Exercise 4.5), 
that is, of Fpd. 

For the last statement, recall that the multiplicative group of nonzero elements 
of a finite field is necessarily cyclic (Theorem IV.6.10). If a € F p « is a generator of 
this group, then a will generate Fp. over any subfield; if d \ e, this says F p . = F p d (a), 
so Fpd C Fp. is simple. □ 

18 Keep in mind that F„ is not the ring Z/gS unless g is prime: if q is composite, then Z/gZ 
is not an integral domain. 

19 Of course s 4 + 1 is irreducible over Z. This example shows that Proposition V.5.15 cannot 
be turned into an ‘if and only if’ statement. 
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These results can be translated into rather precise information on the structure 
of the polynomial ring over a finite field. For example, 

Corollary 5.5. Let F be a finite field. Then for all integers n > 1 there exist 
irreducible polynomials of degree n in F[x]. 

Proof. We know F = F p ,i for some prime p and some d > 1. By Corollary 5.4 there 
is an extension F p d C F p d„, generated by an element a. Then [F p rf„ : F p ,i] = n, 
and it follows that the minimal polynomial of a over F = F p d is an irreducible 
polynomial of degree n in F[x] . □ 

In fact, our analysis of extensions of finite fields tells us about explicit fac- 
torizations, leading to an inductive algorithm for the computation of irreducible 
polynomials in F,[x): 

Corollary 5.6. Let F = F, be a finite field, and let n be a positive integer. Then 
the factorization of x q " — x in F[x) consists of all irreducible monic polynomials 
of degree d, as d ranges over the positive divisors of n. In particular, all these 
polynomials factor completely in F,.. . 

Proof. By Theorem 5.1, F,.. is the splitting field of x q " — x over F p , and hence 
over F, = F. 

If /(x) is a monic irreducible polynomial of degree d, then F[x]/(/(x)) = F(a) 
is an extension of degree d of F, that is, an isomorphic copy of F,<i. By Corollary 5.4, 
if d | n, then there is an embedding of F,.i in F,« . But then o must be a root of 
x q — x, and hence x q “ — x is a multiple of /(x), as this is the minimal polynomial 
of a. This proves that every irreducible polynomial of degree d \ n is a factor of 
x q " - x. 

Conversely, if /(x) is an irreducible factor of x q " - x, then contains a root 
a of /(x); we have the extensions F = F, C F,(a) C F,.>, and F,(a) S' F,,i for 
d = deg a. It follows that d | n, again by Corollary 5.4. □ 

The picture we are trying to convey is the following: the q n roots of x q ” - x 
clump into disjoint subsets, with each subset collecting the roots of each and every 
irreducible polynomial of degree d | n in F[x). 

Example 5.7. Let’s contemplate the case q = 2: F 2 = Z/2Z. 

• n = 1: the polynomial x 2 - x factors as the product of x and (x - 1) (which 
we could write as (x + 1) just as well, since we are working over F 2 ). These are all 
the irreducible polynomials of degree 1 over F 2 . 

• n = 2: the polynomial x 4 - x must factor as the product of all irreducible 
polynomials of degree 1 and 2; in fact 

x 4 - x = x(x - l)(x 2 + x + 1), 

and the conclusion is that there is exactly one irreducible polynomial of degree 2 
over F 2 , namely x 2 + x + 1. 
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• n = 3: tlie quotient of x 8 — x by x(x — 1) is a polynomial of degree 6, which 
must therefore be the product of the two irreducible polynomials of degree 3 over 
F 2 . It takes a moment to find them: 

a: 3 + x 2 + 1, x 3 + x + 1. 

It also follows that Fg may be realized in two ways as a quotient of Fjfz] modulo 
an irreducible polynomial: 

(a^ + a^ + l) (a^ + z + l)’ 

We know that these two fields must be isomorphic because of Theorem 5.1; it is 
instructive to find an explicit isomorphism between them (Exercise 5.3). 

• n = 4 and 5: these cases are left to the enjoyment of the reader (Exercise 5.4). 

• n = 6: the factorization of z® 4 — x must include z, x — 1, the one irreducible 
polynomial of degree 2, and the two irreducible polynomials of degree 3 found above, 
and that leaves room for 9 polynomials of degree 6, which must be all and only the 
irreducible polynomials of degree 6 over Fj. So the 64 elements of Fm cluster as 
follows: 



The two dotted rectangles delimit the (unique) copies of F 4 and Fg contained in F 04 ; 
these intersect in the (unique) copy ofF 2 . 

Again, Fg 4 may be realized by quotienting F 2 [z] by the ideal generated by any 
of the irreducible polynomials of degree 6; this gives 9 ‘different’ realizations of this 
field. j 

The picture drawn above for Fg 4 means next to nothing, but it may help us focus 
on one last element of information we are going to extract regarding finite fields. 
Note that the effect of any automorphism of Fg 4 must be to scramble elements in 
each sector represented in the picture, without mixing elements in different sectors 
and without interchanging sectors. Indeed, every automorphism of an extension 
sends roots of an irreducible polynomial to roots of the same polynomial. 

Since extensions of finite fields are simple extensions, our previous work allows 
us to be much more precise. Restricting our attention to the extensions F p C F p i, 
for a prime p, we know these can be realized as simple extensions by an element 
with minimal polynomial of degree d. This polynomial is necessarily separable 
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(F p is perfect), so Corollary 1.7 immediately gives us the size of the automorphism 
group: 

| Aut Fp (F^)| = d. 

But what is this group? 

Proposition 5.8. AutF y (F p <i ) is cyclic, generated by the Frobenivs isomorphism. 

Proof. Let be the FVobenius homomorphism F p s — ► F p .t: ip(x) = x p . The 
Frobenius homomorphism is an isomorphism on a finite field (Corollary 4.18) and 
restricts to the identity on F p (Exercise 4.11), so tp € Autp j> (F p d). Since we know o 
priori the size of this group, all we have to show is that the order of ip is d. 

Let e = \tp\. Then <p e = id; hence x pt = x for all x € F p s. In other 
words, the (nonzero) polynomial x pt — x has p d roots in F^; this implies p d < p e 
(Lemma V.5.1). Equivalently, d < e, yielding 

| AutF p (Fpi)| < \<p\. 

But then these two mimbers must be equal, since the order of an element always 
divides the order of the group (Example II.8.15). □ 

Two last comments on finite fields are in order: 

— For n large enough, the stupendously large (but finite) field F p »i works as an 
approximation of the algebraic closure F p : indeed, this field contains roots of all 
polynomials of degree < n in F p [a;], by Corollary 5.6. 

This observation can be turned into the explicit construction of an algebraic 
closure of F p : by Corollary 5.4 there are extensions 

Fp C F p 2 C Fpa C Fp<i C - • • 

and the union of this chain of fields 20 gives a copy of F p . 

— Finite fields form a category, which (by Corollary 5.4) should strongly remind 
the reader of the ‘toy’ category encountered in Exercise 1.5.6. There is an impor- 
tant difference, however: in that category every object has exactly one automor- 
phism, while we have just checked that finite fields may have many automorphisms. 
Challenge: The reader has encountered a very similar situation in a different (but 
related) context. Where? 

5.2. Cyclotomic polynomials and fields. In the last several sections we have 
often encountered roots of 1 in C; the extensions they generate are very important. 

Let n be a positive integer. We will denote by C„ the complex number e 2m / n ; 
thus, the n roots of the polynomial x n — 1 in C are the n distinct powers of Cn‘> 
they form a cyclic subgroup of order n of the multiplicative group of C, which we 
will denote fi n . 

Pictorially, the roots of 1 are placed at the vertices of a regular n-gon centered 
at 0, with one vertex at 1. 


20 A more formal construction requires the notion of direct limit; cf. §VIII.1.4. 
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"'• Cn 



A primitive n-th root of 1 is a generator of /x„. Thus, Cn is primitive; from our 
work on cyclic groups, we know (Corollary II.2.5) that C™ is a primitive n-th root 
of 1 if and only if m is relatively prime to n. In particular, there are <f>(n) primitive 
roots of 1, where <t> is Euler’s totient function (cf. Exercise II.6.14). Thus, 

*»(*):= n (*-o= n (*-o 

< primitive root of 1 l<m<n,(m,n)=l 

is a polynomial of degree 4>{n). 

Definition 5.9. The polynomial $„(x) is called the n-th cyclotomic polynomial, j 

It is clear that 4> n (x) is a monic polynomial of degree <t>(n). It is perhaps a 
little less immediate that $„(x) has rational (in fact, integer) coefficients and that 
it is irreducible over Q. We will prove these facts in short order. 

Example 5.10. If n = p is prime, then every nonidentity element of p p = C p is a 
generator: every p-th root of 1 is primitive except 1 itself. Therefore 

is the particular case encountered in Example V.5.19, where we proved that $ p (x) 
is indeed irreducible. j 

What if n is not prime? 

Lemma 5.11. For all positive integers n, 

x n - 1 = $d(z)- 

l<d|n 

Proof. If n = de, then every d-th root C of 1 is an n-th root of 1, because C n = 
C 1 *® = (C d ) e = 1- In particular, every primitive d-th root C of 1 is an n-th root of 1. 

On the other hand, every (£/t„ generates a subgroup H of (i n , and H = fid for 
d equal to the order of C, a divisor of n (Proposition II.6.11). Thus, every C € Pn 
is a primitive d-th root of 1 for some d | n. 

Thus the set of n-th roots of 1 equals the union of the sets of primitive d-th roots 
of 1, as d ranges over all positive divisors of n. The statement follows immediately: 

i=n(*-o= n ( n 

C €/i n 1 \C primitive d-th root of 1 J l<djn 


as claimed. 


□ 
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This argument brings us back to simple group-theoretic considerations. The 
reader can check that Lemma 5.11 implies immediately the result of Exercise n.6.14, 
by comparing degrees of both sides of the stated identity. 

Lemma 5.11 yields an inductive computation of cyclotomic polynomials; the 
fact that 4> n (x) € Z[x] follows from this fact. Explicitly, 

Corollary 5.12. The cyclotomic polynomials $ n (x) have integer coefficients. 


Proof. Use induction on n. Note that 4>i(x) = x — 1, and assume we have 
shown that all $«(x) have integer coefficients for m < n. In particular, /(x) := 
n K ,|„ (i< , 1 4><j(x) is a monic polynomial with integer coefficients. Since f(x) is 
monic, we can divide it into x" — 1 with remainder, within Z[x]: 3q(x), r(x) € Z[x] 
sudi that 

x m - 1 = f(x)q(x) -)- r(x), 

with r(x) = 0 or degr(x) < deg/(x). On the other hand, by Lemma 5.11, 

x" - 1 = /(x)$„(x) 

in C[xJ. Therefore 

/(x)($(x) - g(x)) = r(x) 

in <C[x). But this forces r(x) = 0 (otherwise we would have degr(x) > deg/(x)). 
Therefore 4>(x) = q(x) € Z[x]. □ 


Example 5.13. The reader can spend some quality time computing explicitly the 
cyclotomic polynomials for several nonprime numbers n, working inductively 
and capitalizing on the fact that we know explicitly $ p (x) for prime p. 

Ibr example, x 4 — 1 = 4> i (x) $2 (x )$4 (x) ; therefore 

x 4 — 1 


$ 4 (x)= x^T 
Since x 6 - 1 = $ 1 (x) $2 (x)$s (x) $6 (x) , 

x e -l 


$ 6 (x) = 


= x^ + 1. 


= x 2 — X + 1. 


(x 2 - l)(x 2 + X + 1) 
Since x 12 — 1 = $i(x)$ 2 (x)$ 3 (x)$ 4 (x)$e(x)$i 2 (x), 

x* 2 — 1 


$12 (x) = 


(x 6 — l)(x 2 -I- 1) 


= X 4 — X 2 + 1, 


and so on ad libitum. 


The optimistic reader may now start guessing that the coefficients of (x) 
are always 0 or ±1 (so would we, if we didn’t know better). Apparently the first 
counterexample is found for n = 105 (Exercise 5.9), and the coefficients are known 
to get as large as one pleases for n » 0. 

The irreducibility of $ n (x) is a bit trickier, in particular because separability 
sneaks into the standard argnment (which is why we had to wait until now to 
present it). 

Proposition 5.14. For all positive n, $ n (x) € Z[x] is irreducible over Q. 
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(Since $„(x) is monic, irreducibiiity in Q[x] is equivalent to irreducibility in 
Z[x]; cf. Corollary V.4.17.) 

Proof. Arguing by contradiction, assume 4>„(x) is reducible. Then its roots Q n , 
with (m, n) = 1, are divided among the factors: we can choose a root of one 
irreducible monic factor /(x), such that another root (for some prime p not 
dividing n) is not a root of /(x). Write 

«Mx) = f(x)g(x): 

since <f>„(x) € Z[x] and 4>„(x), /(x) are monic, then /(x) and g(x) have integer 
coefficients (cf. Exercise V.4.23). By our choice, f(x) is the minimal polynomial 
of C,T over Q, and g( £™ p ) = 0. 

It follows that Q” is a root of g(x p ), and hence f(x) \ g(x p ). Therefore we can 
write 

fl(x p ) = f(x)h(x) 

with h(x) € Z[x). Reading the last equation modulo p, we get (again using Exer- 
cise 4.8, and denoting cosets by underlining) 

g(x) p - f(x)h(x) in F p [x); 

in particular, /(x) and g(x) must have a nontrivial common factor C(x) in F p [x]. 
But then 

£(*) 2 |/(x)g(x): 

the reduction of 4>„(x) modulo p must have a multiple factor. 

This implies that x" — 1 € F p [x) has a multiple factor; that is, it is inseparable. 
However, its derivative nx' ,_1 e F p [x] is nonzero (because p does not divide n by 
assumption), and Lemma 4.13 implies that x" - 1 is separable in F p [x]. 

This contradiction shows that our assumption that $ n (x) is reducible must be 
nonsense, proving the statement. □ 

Definition 5.15. The splitting field Q(C„) for the polynomial x* 1 — 1 over Q is the 
n-th cyclotomic field. j 

By Proposition 5.14, Q(Cn) is an extension of Q of degree <t>{n): 4>„(x) is the 
minimal polynomial of Q,. 

As usual, we now preoccupy ourselves with the automorphism group of this 
extension, and as in previous examples the situation is very neat. 

Proposition 5.16. AutQ(Q(£„)) is isomorphic to the group of units in TL/riL. 

Proof. We know that Autq,(Q(e„)) has cardinality <j>{n) (Corollary 1.7; the roots 
are distinct since 4>(n) is separable), so all we need to do is exhibit an injective 
homomorphism 

j : (Z/nZy — AutQ(Q(C„)). 

For [m]„ 6 (Z/nZ) m , that is, for m an integer relatively prime to n. let j([rn]„) be 
the unique automorphism Q(C«) — 1 ► Q(0>) sending Q, to Q' (cf- Proposition 1.5); 
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this is evidently independent of the representative m. Then j is clearly injective, 
and 

i(K] n ) o y([m 2 ] n )(C„) = j([mi]„)(C a ) = C*"* 

agrees with the action of j([roi] n [m 2 ]„), showing that j is a homomorphism and 
concluding the proof. □ 

Example S.17. The reader should consider Examples 4.4 again: by what we have 
just proved, the automorphism group of the splitting field of a: 8 - 1 is isomorphic 
to the group of units in Z/8Z; this group is immediately seen to be isomorphic to 
(Z/2Z) x (Z/2Z), confirming the claim made in Example 4.4. j 

The constructibility of the regular n-goii by straightedge and compass is of 
course equivalent to the constructibility of the complex number Since we now 
know that (Q((W) : Q] = 4>(n), Corollary 3.6 tells us that the regular n-gon is con- 
structible by straightedge and compass only if <f>(n) is a power of 2. This condition 
can be chased a little further (Exercise 5.19). In any case, we shall return to this 
theme one more time, after we absorb a little Galois theory. 

5.3. Separability and simple extensions. All the examples of field extensions 
we have encountered so far were simple extensions, although sometimes this may 
not be completely apparent at first (cf. Example 1.19). There is a good reason for 
this: most of the examples we have seen were separable extensions, and we can now 
prove that a finite separable extension is necessarily simple. 

Here is a nice criterion for simplicity (which does not involve any separability 
condition): 

Proposition 5.18. An algebraic extension k C F is simple if and only if the 
number of distinct intermediate fields k C E C F is finite. 

Proof. Assume F = k(a) is simple and algebraic, and let gfc(a’) be the minimal 
polynomial of a over fc. Embed F in an algebraic closure k of k. If £ is an 
intermediate field, then F = E(a) is also a simple, algebraic extension; denote 
by gg(x) the minimal polynomial of a over E. Since qk{x) € E[x\ for all E and 
qk(oi) = 0, we know that each qe(x) is a factor of qt(x). 

We claim that E is in fact determined by ge(*). Since gr-(:c) has finitely many 
factors in k, this proves that there are only finitely many intermediate fields, that 
is, the ‘only if’ part of the statement. 

lb verify our claim, we will show that E is generated (over k) by the coefficients 
of 9 b(x). To see this, let E' be the subfield of E generated by k and the coefficients 
of ^(s). Then qE(%) e £'[a;], and since $£(*) is irreducible over E, it must be 
irreducible over E'. Note that E'(a) = F = E(ot). We have E' C E C F; therefore 

**(«(*)) = F : E'\ = [F ■ E\[E : E'\ = deg(g B (z))[£ : £'], 

from which (£:£'] = 1; that is, E = E' as needed. This concludes the proof of 
the ‘only if’ part of the statement. 

To prove the ‘if’ part, assume that there are only finitely many intermediate 
fields k C E C F. The extension k C F must be finitely generated (otherwise 
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we could construct an infinite sequence of subextensions fc C k(a\) C k(»i , 02 ) £ 
■ C F, and we would have infinitely many intermediate fields), hence finite since 
it is algebraic. If fc is a finite field, then so is F, and the extension is automatically 
simple by Corollary 5.4; thus we may assume that fc is infinite, and we have to show 
that every finitely generated algebraic extension F = fc(«i, . . . , a r ) is simple. 

Arguing inductively, we may assume without loss of generality that F = k(a, ft). 
For every c € fc, we have the intermediate field 

F C fc(ca + ft) C k(a, ft). 

But there axe only finitely many intermediate fields, while fc is infinite, so for some 
c ± d in fc we must have 

fc(c'a + ft) = k{ca + p). 

It follows that 

a = ^ a + — (ca + (3) g j,(ea + and ft = (ca + 0) — ca 6 k(ca + 0) 

c — c 

and therefore k(a, (3) C k(ca + ft). The other inclusion holds trivially, so fc(a, ft) = 
k(ca + ft) is simple. □ 

Here is the connection with separability: 

Proposition 5.19. Every finite separable extension is simple. 


Proof. Arguing inductively as in the proof of Proposition 5.18, we may assume 
F = fc(a, ft), with a and ft separable (and in particular algebraic) over fc, and we 
may assume fc is an infinite field. 

Consider the set I of embeddings 1 : F *-* fc of F in an algebraic closure of 
fc, extending the identity of fc. If t 5 ^ t' in 7 and x is an indeterminate, then the 
polynomials 

r(a)a + ft ft), i'(a)x + i'(ft) 

are different: otherwise i'(a) = /.(a) and t'(ft) - ftp), so 1 , if would act in the same 
way on the whole of k(a,ft), forcing 1 = 1 ' . 

Therefore, the polynomial 


}{x) = n ((*(«)* + <■(&)) ~ (*'(«)* + *■'($))) e fc(*] 


is not identically 0. Since fc is infinite, it follows that 3c 6 fc such that /(c) # 0; 
that is, distinct 1 € I map 

7 = ca + ft 


to distinct elements 


<•(7) = t(a)c+ ft ft) 

(each t extends id*, so ftc) = i). Since the cardinality of 7 is (F : fc] a (Defini- 
tion 4.21) and each ftj) is a root of the minimal polynomial of 7 over fc, we have 


[F : fc] a < (fc( 7 ) : fc] < |F : fc). 
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By assumption the extension is separable, so [F : fc], = [F : fc] by Proposition 4.24, 
implying [£( 7 ) : fc] = [F : fc] and finally 

F=k( 7 ), 


concluding the proof. □ 

Proposition 5.19 is a good illustration of why separability is a technically desir- 
able condition. To stress the point even further, note that we are now in a position 
to extend the convenient inequality found in Corollary 1.7 to all finite separable 
extensions: 

Corollary 5.20. Let fc C F fee a finite, separable extension. Then 

| Autfc(F)| < [F : fc], 

with equality if and only if kC F is a normal extension. 

Proof. Since fc C F is Unite and separable, Proposition 5.19 implies it is simple: 
F = k(a) for some a € F. The inequality foEows immediately from Corollary 1.7, 
and equality holds if and only if the minimal polynomial f(x) of a factors into 
distinct linear factors in F. In this case F is the splitting field of /(x), so it 
is normal over k by Theorem 4.S. Conversely, if F is normal over fc, then f(x) 
splits completely in F and has distinct roots since a is separable over fc; therefore 
| Ant*(F)| = [F : fc], again by Corollary 1.7. □ 

Example 5.21. By Proposition 5.19, if we want to construct a nonsimple finite 
extension, we have to use inseparable elements; to produce an example, we jazz up 
Example 4.11 a little. Consider the field of rational functions F = F p (u,v) in two 
variables over F p and the subfield fc = F p (s, t), where s = vP and t = v p . Then 
fc C F is an algebraic extension; the minimal polynomial of w over fc is x p — s; the 
minimal polynomial of v over k(u ) is y p — t; it follows that 

[F : fc] = p 2 . 

As c ranges in fc, we obtain intermediate fields 

fc c fc(eu + «) c F. 

If any two choices c, o' led to the same intermediate field, then we would deduce 
that k(u,v) = k(cu+v) by arguing as in the proof of Proposition 5.18. In particular, 
we would have 

[fc(cu + v) : fc] = p 2 . 

But this is not the case, since (cu -f v) p = <fs + f € k(s, t) = fc; hence the minimal 
polynomial of cu + v over fc has degree at most p. Since fc = F p (s, t) is infinite, 
there are infinitely many intermediate fields, and it follows (by Proposition 5.18) 
that F is not a simple extension of fc. j 
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Exercises 


5.1. Let p > 0 be a prune integer. Prove that the additive group of a finite field 
with p d elements is isomorphic to (Z/p2) <i . 

5.2. Prove that every element of a finite field F is a sum of two squares in F. (Hint: 
Keep in mind that the multiplicative group of F is cyclic.) 

5.3. t> Find an explicit isomorphism 

~ F 2 [,t] 

(x 3 + x 2 + 1) (a: 3 + x + 1) ' 

[§5-1] 

5.4. > Find all irreducible polynomials of degree 4 over F 2 , and count those of 
degree 5. [§5.1] 

5.5. Find the number of irreducible polynomials of degree 12 over Fg. 

5.6. Write out explicitly the action of the cyclic group C 4 on Fie, in terms of any 
realization of this field as a quotient of F 2 (x]. 

5.7. Let p be a prime integer. View the Flrobenius automorphism <p : ¥ p d -* F p j as 
a linear transformation of the F- vector space F p j. Find the rational canonical form 
of >p. (Adapt the proof of Proposition 5.8 to show that the minimal polynomial of 
<p is x d — 1.) 

5.8. For a prime p, find the factorization of $ P (*) over F p . 

5.9. e> Find all cyclotomic polynomials 4>„(x) for 1 < n < 15. Compute #ios(x). 
[§5-2] 

5.10. Find the cyclotomic polynomials $ 2 m (x) for all m > 0. 

5.11. Prove that if n is odd, then $ 2 „.{x) = $„(— ®). (Hint: Draw the primitive 
14-th roots of 1 side-by-side to the primitive 7-th roots of 1; then go back to 
Exercise Et.2.15 to justify the fact you observe.) 

5.12. -> Let a, n be positive integers, with a > 1. Prove that if $„(a) divides a— 1, 
then n — 1. (Remember that $ n (<*) a product of complex numbers a — C. where 
C is a primitive n-th root of 1. What does this tell you about the size of # n (o)V) 
[5.14] 

5.13. -« Let a, d, n be positive integers, with d < n and a > 1. Assume that a d — 1 
divides a n — 1. Prove that $„(o) divides the quotient (a” — 1 )/(a d — 1). (Hint: 
Exercise V.2.13.) [5.14] 

5.14. > Let B be a finite division ring. 

• Prove that the center of J? (Exercise III.2.9) is isomorphic to F 9 , for q a prime 
power. Prove that |f?| = q n for some n. 
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• For every r € R, prove that the centralizer of a: in the multiplicative group (f?*, •) 
has order ^ — 1 for some d<n. 

• Prove that there are integers di, . . . ,dr < n such that 

(*) »"-i-«-i+E£rT- 

(Hint: Class equation.) 

• Deduce that $„(<?) divides q— 1 and hence n = 1. (Use Exercises 5.12 and 5.13.) 

• Conclude that R. equals its center, showing that R is commutative. 

Thus, every finite division ring is a field: this is Wedderburn’s little theorem. The 
argument given here is due to Ernst Witt. [§III.1.2] 

5.16. c> Let a,p, n be integers, with p, n positive and p prime, p\n. 

• Show that x n — 1 has no multiple roots modulo p. 

• Show that if p divides #„(«), then a” = 1 modulo p. (In particular, p f a, so 

\ a \p 6 (Z/pZ)'.) 

• Show that if p divides $„(a), then a d £ 1 modulo p for every d<n. 

• Deduce that p | $„(a) if and only if the order of [a] p in (Z /pZ)* is n. 

• Compute 4>is (9), and show it is divisible by 31. Then look back at the first part 
of Exercise H.4.12. 

[§H.4.3, 5.16, 5.17] 

5 . 16 . If q is a prime that divides o n — 1 and does not divide a d — 1 for any d < n, 
then q is said to be a primitive prime divisor of a n — 1. By Exercise 5.15, if q { n 
and q | $„(a), then q is a primitive prime divisor of a n — 1. The Birkhoff- Vandiver 
theorem asserts that a n — 1 has primitive prime divisors for all but a very short list 
of exceptions: n = l,« = 2;»=2,# + la power of 2; and n = 6, n = 2. 

Assuming this statement, we can give another proof of Wedderbum’s theorem 
(published in 2003 by Nicolas Lichiardopol). Let R. be a finite division ring. 

• Prove that there is a prime p such that |i?.| = p n for some integer n. 

• If n = 1, then R F p . If n = 2 or if p = 2 and rt = 6, then R. is commutative: 
the reader has proved this in Exercises III.2.11 and IV.2.17. 

• Therefore, by Birkhoff- Vandiver we may assume that p” — 1 has a primitive 
prime divisor p*. 

• Prove that there is an element a of order p' in the multiplicative group (/?*, •) 
of R.. 

• Prove that R is the only sub-division ring of R. containing a. 

• Prove that the centralizer of a in R. (Exercise III.2.10) is R itself; deduce that a 
is in the center of R. (Exercise III.2.9). 

• Prove that the center of R is R.: this shows that R is commutative. 


Thus, R is a finite field. 
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Note that the Birkhoff- Vandiver theorem gives another rapid way to conclude 
Witt’s proof: a primitive prime divisor of q n — 1 would divide q — 1 by (*), showing 
that n = 1. 

5.17. -i Let a,p,n be integers, with p,n positive and p prime, p\ n. Assume that 
p divides $„(«)• Prove that y = 1 mod ». (Use Exercise 5.15.) (5.18) 

5.18. o Let 7i > 0 be any integer. Prove that there are infinitely many prime 
integers s lmodn. (Use Exercise 5.17 together with Exercise V.2.25.) 

The result of this exercise is a particular case of Dirichlet’s theorem on primes 
in arithmetic progressions; see §7.6. [§7.6] 

5.19. t> Prove that the regular w-gon can he constructed by straightedge and com- 
pass only if n = 2 m pi . . .p T , where m > 0 and the factors pi are distinct Fermat 
primes. (Hint: Use Exercise V.6.8.) (§5.2, §7.2] 

5.20. Recall from Exercise 1.10 that every held extension of degree < n over a 
field k is a sub-fc-algebra of the ring of matrices M n (k). Prove that if k is finite or 
has characteristic 0, then every extension of k contained in M n (k) has degree < n. 

5.21. Prove that if k C F is the splitting field of a separable polynomial, then it is 
the splitting held of an irreducible separable polynomial. 

5.22. Let k be an infinite field. If F = k(a i , . . . , a r ), with each et; separable over k, 

prove that there exist cj , . . . , <v € k such that F = k{c\ai + f c r a r ). 

5.23. t> Let k be a field, and let n > 0 be an integer. Assume that there are 
no irreducible polynomials of degree n in fc[x). Prove that there are no separable 
extensions of k of degree n. [§7.1] 


6. A little Galois theory 

Galois theory is a beautiful interplay of field theory and group theory, originally 
motivated by the problem of determining ‘symmetries’ among roots of a polynomial. 
Galois was interested in concrete relations which must necessarily hold among the 
roots of a polynomial, the most trivial example of which being the fact that if 

а, /3 are roots of x 2 + px + q, then a + (3 = -p and a(3 = q. Interchanging 
the roots a, 0 has no effect on the quantities a 4- 0, a0. More generally, for a 
higher degree polynomial there may be several quantities which are invariant under 
certain permutations of the roots. These quantities and the corresponding groups of 
permutations may be viewed as invariants determined by the polynomial, yielding 
a sophisticated tool to study the polynomial. 

б. 1. The Galois correspondence and Galois extensions. In the language 
of field theory, subsets of the roots of a polynomial f(x) determine intermediate 
fields of the splitting field of f(x), and groups of permutations of the roots give 
automorphisms of these intermediate fields. With this in mind, it is not surprising 
that splitting fields should come to the fore; the fact that one can characterize such 
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fields in terms of groups of automorphisms (Theorem 6.9 below) will be key to the 
whole discussion. 

Before wading into these waters, we should formalize the precise relation be- 
tween groups of automorphisms of an extension and intermediate fields. 

Definition 6.1. Let k C F be a field extension, and let G C Autjt (F) be a group 
of automorphisms of the extension. The fixed field of G is the intermediate field 

F g := {o G F | Vp e G, go = «}. j 

The fact that F° is indeed a subfield of F containing k is immediate. The 
notion of fixed field allows us to set up a correspondence 

{intermediate fields E: KECF} < ♦ (subgroups of Aut*(F)} , 

sending the intermediate field E to the subgroup Autg(F) of Aut*(F) and the 
subgroup G C Aut*(F) to the fixed field F°. 

Definition 6.2. This is known as the Galois correspondence. _i 

Lemma 6.3. The Galois correspondence is inclusion-reversing. Further, for all 
subgrotips G of Autjt(F) and all intermediate fields kC EC F: 

• EC F Au ‘ e * f ); 

• G C Autpc(F). 

Further still, denote by Ei JSj the smallest subfield of F containing two intermediate 
fields Ei, Ei, and denote by (Gi, G 2 ) the smallest subgroup of Autfc(F) containing 
two subgroups G\, G 2 . Then 

• Aut ElEi (F) = Aut E , (F) n Aut Ej (F); 

• p{Gi,Gi) _ pGi (-j pGt 

Proof. Exereise 6.1. □ 

Of course the reader should start wondering whether there are situations guar- 
anteeing that the inclusions appearing in Lemma 6.3 are equalities, making the 
Galois correspondence a bijection. This is precisely where we are heading. The 
first observation is that this is not always the case: 

Example 6.4. Consider the extension Q C Q(v^). Since 

[Q(^2) : Q] = 3 

is prime, the only intermediate fields are Q and Q(v // 2) (by Corollary 1.11). Con- 
cerning Autq.(Q(v / 5)), since s/2 € R, we have an extension Q(v^2) C R; since s/2 is 
the only cube root of 2 in R, we see that the minimal polynomial t 3 — 2 of s/2 has a 
single root in Q(^2). By Corollary 1.7, AutQ(Q(^2)) consists of a single element: 
it is trivial. 

Thus, in this example the Galois correspondence acts between a set with two 
elements and a singleton: 


{Q, Q(^)} ?=* (Aut<j(Q(v / 2))} = {e> • 
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In particular, the function associating with each intermediate field the correspond- 
ing automorphism group is not injective in general. j 

This example shows that the inclusion E C F Au1k(f ) in Lemma 6.3 may be 
proper: Q is properly contained in Q(v // 2), which is the fixed field of the (only) 
subgroup of Autq(Q( v^)). Fortunately, the situation with the other inclusion is 
more constrained: 

Proposition 6.5. Let k C F be a finite extension, and let G be a subgroup 
of Autfc(F). Then |G| = [F : F G ], and 

G = Autjpc(F). 

In particular, the Galois correspondence (from intermediate fields to automorphism 
groups ) is surjective for a finite extension. 

Proving this fact leads us to examining very carefully finite extensions of the 
type F° C F. Surprisingly, these turn out to satisfy just about every property we 
have encountered so far: 

Lemma 6.6. Let k C F be a finite extension, and let G be a subgroup of Aut*(F). 
Then F G C F is a finite, simple, normal, separable extension. 

Remark 6.7. The key to this lemma is the following observation, which is worth 
highlighting. 

If a € F and g € G, note that got must be a root of the minimal polynomial of 
a over k; there are only finitely many roots, so the G-orbit of a consists of finitely 
many elements a = ai , . . . , a„. The group G acts on the orbit by permuting its 
elements; therefore, every element of G leaves the polynomial 

q a (t) = (t - a t ) ■ ■ ■ (t. - a n ) 

fixed. In other words, the coefficients of this polynomial must be in the fixed 
field F g . Further, q a (t) is separable since it has distinct roots. Finally, note 
that degg a (f) < |G|. j 

With this in mind, we are ready to prove the lemma. 

Proof of Lemma 6.6. The extension F° C F is finite because k C F is finite. 

Let q G F; by the remark following the statement of the lemma, a is a root of 
a separable polynomial q a (t) with coefficients in F° . It follows that a is separable 
over F G ; hence the extension is separable according to Definition 4.12. 

Since F° C F is finite and separable, it is simple by Proposition 5.19; let a be 
a generator. The polynomial q a (t) splits in F, and F is generated over F G by the 
roots of q a (f) (indeed, a\ = a suffices to generate F over F G ); therefore, F is a 
splitting field for q a (t) over F G according to Definition 4.1. Therefore F G C F is 
normal by Theorem 4.8, and we are done. □ 

Proposition 6.5 is a consequence of Lemma 6.6: 
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Proof of Proposition 6.5. By Lemma 6.3, G is a subgroup of Autpc (F), and in 
particular 

|G| < | Autjro(F)|. 

In order to verify that G = Aut^c, it suffices to prove the converse inequality. By 
Lemma 6.6, F = F G (a) for some a € F; therefore | Aut/ra(F)[ equals the number 
of distinct roots in F of the minimal polynomial of a over F a (by Corollary 1.7). 
With notation as in Remark 6.7, a is a root of q a {x) £ F G [a:]; hence the minimal 
polynomial of a is a factor of q a (x). Since by construction the number of roots of 
q a (x) is < |G|, we obtain 

| Autjro(F)| < |G|, 

as needed. 

To verify that (F : F G ] = |G|, just note that (F : F G ] = | Autpc(F)| by 
Corollary' 5.20, since F° C F is normal and separable by Lemma 6.6. □ 

Remark 6.8. The hypothesis that the extension is finite in Proposition 6.5 is 
necessary': the Galois correspondence is not necessarily surjective in the infinite 
case. Not all is lost, though: one can give a suitable topology to the group of 
automorphisms and limit the Galois correspondence to closed subgroups of the 
automorphism group, recovering results such as Proposition 6.5. The reader is 
welcome to explore this further — we will not be able to take this more general 
point of view here. _j 

Theorem 6.9. Let k C F be a finite field extension. Then the following are 
equivalent: 

(1) F is the splitting field of a separable polynomial /(t) £ fe[£] over k; 

(2) kQ F is normal and separable; 

(3) | Autfe(F)| = [F : fcj; 

(4) k = F Aut *<F) is the fixed field of Ant k {F); 

(5) the Galois correspondence for kC F is a bijection; 

(6) kC F is separable, and if F C K is an algebraic extension and o € Aut*(jK’), 
then <r(F) = F. 

Proof. Most of the needed implications have been proven along the way. 

(1) 4=4- (2) by Theorem 4.8; (2) 4=^ (3) by Corollary 5.20. (3) 4=t> (4) follows 
from Proposition 6.5, applied to the extension F Aulk i F ) C F: by Proposition 6.5, 
we have 

[ F . F Aut k (F)] = | Aut j, (F) | ; 

since k C F Au M f ) C F, it follows that k = i? Aut *( f ) if and only if | Aut*(F)| = 
[F:*j. 

(2) 4=> (6) holds by Exercise 4.7, since finite separable extensions are simple 
(Proposition 5.19). 

To prove (5) => (4), let E = F Aut *( F ); by Proposition 6.5, Aute(F) = 
Autfc(F). Assuming that the Galois correspondence is bijective, it follows that 
fc= F = F Aut ‘( F ), giving (4). 
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Finally, we prove that (1) ==> (5). Since k C F is a finite extension, we 
already know that the Galois correspondence from intermediate fields to subgroups 
of Autfr(F) has a right inverse (Proposition 6.5), so it suffices to show it has a left- 
inverse. Therefore, it suffices to verify that every intermediate field E equals the 
fixed field of the corresponding subgroup Aut£(F). Then let £ be an intermediate 
field. By (1), F is the splitting field of a separable polynomial /(t) € k[t] C F(t]; 
therefore, condition (1) holds for the extension E C F- As we have already proved 
that (1) e=s- (4), this implies that E = F Au ' e ( f ), and we are done. □ 

Definition 6.10. A finite extension k C F is Galois if it satisfies any of the 
conditions listed in Theorem 6.9. j 

Warning: We will not study the Galois condition for infinite extensions (cf. Re- 
mark 6.8). Thus Galois extensions will implicitly be finite in what follows. 

Condition (6) is technically useful and helps with visualizing the Galois condi- 
tion. If a finite separable extension k C F is not Galois, then F can be embedded 
in some larger extension k C K (for example, in the algebraic closure k C k) in 
many possible ways: 



If k C F is Galois, all these images must coincide: 



Of course there are possibly still many ways 21 to embed F in K, but they all have 
the same image F. 

The mysterious comments at the end of Example 1.4 are now hopefully clear. 
Galois extensions of a field k are well-defined subfields of k, preserved by auto- 
morphisms of k over k. Contrast this situation with the non-Galois extension 
Q C which admits three different embeddings in Q. For example, it does 

not make sense to talk about 'the’ composite of Q(\/2) and Q(\/2), as a subtield 
of (for instance) C: according to the chosen embedding of Q(\/2), this may or may 


21 In fact, precisely [F : ft). = (F : Jsj = | Aut(t(/’)| if A' — k and hence for all K 3 F. 
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not be a subfield of R. But 22 it does make sense to talk about the composite of two 
Galois extensions k C Fy, k C F 2 , since both Fy and F 2 are well-defined as subfield 
of k, so their composite F,F 2 is independent of the choice of embeddings Fy C k, 
F 2 C k. 

Definition 6.11. If k C F is a Galois extension, the corresponding automorphism 
group Aut*.(F) is called the Galois group of the extension. j 

To reiterate, the extension QCQ( (/2) is not a Galois extension: as we have 
observed in Example 6.4, AutQ(Q( v^2)) is trivial (hence, for example, condition (3) 
of Theorem 6.9 does not hold). 

On the other hand, we have already run into several examples of Galois exten- 
sions: the examples of splitting fields we have seen in §4.1 are all Galois extensions. 
The ‘Galois fields’ of §5.1 are (surprise, surprise) Galois extensions of their prime 
subfields, by Theorem 5.1. The cyclotomic fields (§5.2) are Galois extensions of Q. 

A Galois extension k C F is cyclic, abelian, etc., if the corresponding group 
of automorphisms AuU-(F) is cyclic, abelian, etc. For example, Q C Q(( 8 ) is an 
abelian, but not cyclic, Galois extension (cf. Proposition 5.16 and Example 5.17). 

6.2. The fundamental theorem of Galois theory, I. The fundamental theo- 
rem of Galois theory amounts to a more complete description of the Galois corre- 
spondence for Galois extensions. 

At this stage, this is what we know: 

• If k C F is a (finite) Galois extension, then there is an inclusion-reversing 
bijection 

{intermediate fields E: kC E C F} <. > {subgroups of Aut*,(F)} : 

through this bijection, the intermediate field E corresponds to the subgroup 
Aut B (F) of Autjt(F), and the subgroup G C AuU(F) corresponds to the fixed 
field F G . 

• For every intermediate field E, 

[F:F] = |Aut £ (F)|. 

The extension E C F is then also a Galois extension (Exercise 6.3), and 
[E:*| = |Aut*(F):Aut B (F)|: 

this follows from Lagrange’s theorem (Corollary II.8.14) and its field theory 
counterpart (Proposition 1.10). 

• The extension k C E is not necessarily Galois. (Once more Q( \/2) gives a 
counterexample, since it can be viewed as intermediate in the splitting field 
of t 3 - 2 over Q.) 

The fact that the Galois correspondence is a bijection may be upgraded as 
follows: 

^^'Separability' does not play a role in these considerations; thus they best convey ‘Galoisness’ 
in, say, characteristic 0. 
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Theorem 6.12. Let k C F be a Galois extension. The Galois correspondence is 
an inclusion-reversing isomorphism of the lattice of intermediate subfields ofkCF 
with the lattice of subgroups of Aut *(F). 

That is (with notation as in Lemma 6.3), if E\, Ez are intermediate fields and 
G i, Gz are the corresponding subgroups of Aut k(F), then E\ n Ez corresponds to 
{G\,Gz) and E\Ez corresponds to G\ n Gz- 

Proof. This follows immediately from Theorem 6.9 and Lemma 6.3, which gives 
Autj 5 ,j 5 2 (F) = Gi nGz, F< G - G ’> = EiOEz 
as needed. □ 


In this context, it is common to use the ‘vertical’ depiction of field extensions, 
with a parallel notation for subgroups: 

F {e} 

I I 

E Aut £ (F) 

I I 

k Aut *(F) 

The content of Theorem 6.12 is that the lattice of intermediate fields of a Galois 
extension k C F and the lattice of subgroups of the corresponding Galois group are 
identical. 

Example 6.13. For finite fields, this coincidence of lattices was essentially proven 
‘by hand’ in Corollary 5.4 and Proposition 5.8. For example, the extension F 2 C F 64 
is Galois, with cyclic Galois group generated by the Frobenius automorphism <p. 
The lattices are 



Theorem 6.12 is often used in the ‘groups to fields’ direction: finding the lattice 
of subgroups of a finite group is essentially a combinatorial problem, and through 
the Galois correspondence it determines the lattice of intermediate fields of a cor- 
responding Galois extension. 

Example 6.14. The extension Q(v/2, \/3) = Q( v/2 4- \/3) studied in Example 1.19 
is the splitting field of the polynomial t 4 — 10 1 2 4- 1, so it is Galois. We found that 
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its Galois group is Z/2Z x Z/2Z; the lattice of this group has no mysteries for us: 

{(0,0)) 



<(1,0)) {(1,1)) {(0,1)) 



Z/2Z x Z/2Z 


and therefore the lattice of intermediate fields is just as transparent: 


Q(\/2, V3) 

/ I \ 

Q(V2) Q(\/6) Q(V5) 



(The intermediate fields are determined by recalling the generators of the corre- 
sponding subgroups, as found in Example 1,19: the flip \/3 •— > — \/3 leaves Q(\/2) 
fixed, etc.) Galois theory tells us that there is no other intermediate field. j 


6.3. The fundamental theorem of Galois theory, II. Another popular no- 
tatioual device is to mark an extension by the corresponding Galois group, if the 
extension is Galois: 

F 

( | Aut B (F) 

E 

I 

k 


Example 6.4 shows that the extension k C E need not be a Galois extension: 
Q(^) is an intermediate field for the extension of Q into the splitting field for the 
polynomial t 3 — 2 (which is Galois, by part (1) of Theorem 6.9), but as we have 
seen, it is not Galois. The splitting field has degree 6 over Q and is a quadratic 
extension of Q(v^). This is in fact the typical situation: every finite separable 
extension may be enlarged to a Galois extension (Exercise 6.4). 

In any case, the question of whether the extension of the base field k in an 
intermediate field £ of a Galois extension kC F is Galois is central to the theory. 
The outcome of the discussion will be very neat, and this is possibly the most 
striking part of the fundamental theorem of Galois theory: 


Theorem 6.15. Let k C F be a Galois extension, and let E be an intermediate 
field. Then k C E is Galois if and only if Aute(.F) is normal in Aut*(F); in this 
case, there is an isomorphism 


Autjfe(£) ^ 


Autjji(F) 

AutE(F) ' 
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The ‘problem’ with QCQ( ^2) is that the automorphism group of the splitting 
field of f 3 — 2 is S 3 (as we will soon be able to verify: cf. Example 7.19). The 
subgroup corresponding to Q(v^2) has index 3; hence (as we know, since we are 
well acquainted with S3) it is not normal. 

The fact that the Galois group of k C E will turn out to be isomorphic to the 
quotient Aut*(F)/ Autj;(F) in the Galois case should not be too surprising, since 
we know already that in any case [fi : fe] equals the index of Aut^(F) in Autfc(F), 
and the index equals the order of the quotient if the subgroup is normal. In general, 
[E : fc] equals the number of cosets of Autg(F) in Autfc(F), and it is worth shooting 
for a concrete interpretation of these cosets, even when kC E does not turn out to 
be a Galois extension. 

We already know that the number of left^cosets of Autg(F) equals the index 
of Autfi(F) in Autfc(F), hence [E : Is), hence ( E : &]« (because if C E is separable, 
as a subextension of a separable extension), and hence there must exist a bijection 
between the left-cosets g Aut£r(F) as g ranges in Aut*(F) and the different embed- 
dings t : E —> 1c in an algebraic closure of k, extending id^ . We are now going to 
find a ‘meaningful’ bijection between these two sets. 

Fix an embedding F C k of F in an algebraic closure of k, and let I be the 
set of embedding* t : F e -> £ extending the identity on k. It is a good idea to 
view k C E as an abstract extension, rather than identifying E with a specific 
intermediate field in F; the fact that we can view F as an intermediate field of the 
extension k C F simply means that t(E) C F for some t 6 /. 

The following general observations clarify the situation considerably: 

• For all tel, t(E) C F. 

Indeed, k C E is simple (finite aud separable simple, Proposition 5.19). If 
E = k(a), then 1 is determined by i(a), which is necessarily a root of the minimal 
polynomial of a over k. But kC F is normal and contains some root i(a) of this 
polynomial; hence F contains all of them (Definition 4.7). That is, t(a) e F for all 
tel, implying t(k(a)) C F for all t e /. 

• Let tel, and let g e Aut*(F); then go t e I. Therefore, Autfe(F) acts on I. 

Here g o t is interpreted in the following way: t(E) C F, as we have seen; hence 
g restricts to a homomorphism t(E) —* F, and g o t is the composition of this 
homomorphism with t. 

This is an excellent moment to review the material on group actions we covered 
in §11.9, especially §11.9.3. Recall in particular that we were able to classify all 
transitive actions of a group on a set (Proposition II.9.9); this is precisely our 
situation: 

• The action of Autfc(F) on I is transitive. 

This follows from the ‘uniqueness of splitting Helds’, Lemma 4.2. Indeed, let t \ , t? € 
I. Both ti(E), t<i (E) contain k, and F is a splitting field over fc (part ( 1 ) of 
Theorem 6.9); hence F is a splitting field (for the same polynomial) over both tj (E) 
and 12 (E). By Lemma 4.2, there exists an automorphism g : F -* F extending the 
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isomorphism 

12 o tf 1 : <-i(E) — * 12 (E). 

Then g € Aut*(F) (since 12 o if 1 restricts to the identity on k), and the fact that 
g restricts to 12 0 t^ 1 means precisely that goi t = t 2 - 

Now choose one t € I, and use it to identify E with an intermediate field of 
k C F. Then Aut^(F) = Aut t (£j(F) is the subgroup of Aut*(F) consisting of 
those elements g which restrict to the identity on E — l(E), that is, the stabilizer 
of t. 

• As an Autfc(F)-sef, I is isomorphic to the set of left-cosets of Autg(F) in 
Autfc(F). 

At this point, this follows hmnediately from Proposition II.9.9. 

As promised, we have found a meaningful bijection between the two sets, ‘ex- 
plaining’ the equality 

[E : ft] = [E : ft), = \I\ = [Aut fc (F) : Aut*(F)] 

that we had already discovered by numerical considerations (= Lagrange’s theorem; 
cf. the beginning of §6.2). Further, we are now in a position to prove Theorem 6.15. 

Proof of Theorem 6.15. As observed above, the Aut*(F)-stabilizer of t 6 / 
equals Aut i (£j(F). By Proposition II.9.12, stabilizers of different 1 are conjugates 
of each other; if Aut£;(F) is normal, it follows that for all 4 € I 

Aut t ( B )(F) = Autjg(F). 

This implies that i(E) = E for all 1 € I, since the Galois correspondence is bijective 
for the Galois extension k C F. It follows that k C E satisfies condition (6) of 
Theorem 6.9; hence it is Galois. 

Conversely, assume that k C E is Galois; by the same condition (6), t(E) = E 
for all t e I. Restriction to E = i(E) then defines a homomorphism 

p : Autjb(F) «■ Autjfc(E). 

This homomorphism is surjective (because the Aut t( Fraction on I is transitive), 
and its kernel consists of those g € Aut*(F) which restrict to the identity on E, 
that is, precisely of Autjj(F). This shows that Autc(F) is normal and establishes 
the stated isomorphism by virtue of the ‘first isomorphism theorem’ for groups. 
Corollary II.8.2. □ 

Remark 6.16. There is a parallel between Galois theory and the theory of covering 
spaces in topology. In this analog}', Galois extensions correspond to regular covers; 
the Galois group of an extension corresponds to the group of deck transformations; 
and Theorem 6.15 corresponds to the fact that the quotient of a regular cover by a 
normal subgroup of the group of deck transformations is again a regular cover. 

More general (connected) covers correspond to more general algebraic exten- 
sions. A space is simply connected if and only if it admits no nontrivial connected 
covers, so this notion corresponds in field theory to the condition that a field K 
admits no nontrivial algebraic extensions, that is, that K is algebraically closed. 
Viewing the fundamental group of a space as the group of deck transformations 
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of its fundamental cover suggests that we should think of the Galois group of the 
algebraic closure 23 k C k as ‘the fundamental group’ of a field k. 

In algebraic geometry this analogy is carried out to its natural consequences. 
A covering map of algebraic varieties X — * Y determines a field extension K (y ) C 
K(X), where K(X), K(Y) are the ‘fields of rational functions’ (in the affine case, 
these are just the fields of fractions of the corresponding coordinate rings; cf. Ex- 
ercise 2.16). One can then use Galois theory to transfer to the algebro-geometric 
environment notions such as the fundamental group, without appealing to topolog- 
ical notions (such as ‘continuous maps from S 1 ’), which would be problematic in, 
e.g., positive characteristic. j 

6.4. Further remarks and examples. Suppose k C F and k C K are two finite 
extensions contained in a larger extension of k (for example, we could choose specific 
embeddings F C k, K C k in an algebraic closure of k). Then we can consider the 
composite FK (cf. Lemma 6.3) of F, A' in the larger extension. It is natural to 
question how the Galois condition behaves under this operation. 

Proposition 6.17. Suppose k C F is a Galois extension and k C Ii is any finite 
extension. Then K C KF is a Galois extension, and Aut k(KF) 2f Autf n /f(F). 


Pictorially, 


KF r 
° Fn A' 


K 


i 


Proof. As k C F is Galois, it is the splitting field of a separable polynomial 
/(x) € fc[x] C K[x], The roots of /(x) generate F over k, so they generate KF 
over K; in other words, KF is the splitting field of f(x) over K, so K C KF is 
Galois. 

If <t : A'F — * A'F is an automorphism extending the identity on K (and hence 
on k), then o restricts to a homomorphism F — ► <r(F) extending the identity on k. 
Since k C F is Galois, <r(F) = F ((6) in Theorem 6.9). Thus, ‘restriction’ defines 
a natural group homomorphism 

p : Aut^(KF) — > Aut*(F). 

We claim that p is injective. Indeed, by definition every o 6 Aut k (A'F) is the 
identity on A'; if o restricts to the identity on F, then o is the identity on the 
composition KF\ that is, o = id in Aut/^A'F). 

Therefore, Aut*,-(A'F) may be identified with a subgroup G of Aut^(F). Since 
every o € Aut/^A'F) fixes A', the restriction p(o) fixes FnA; that is, F G D Fn A'. 
Conversely, suppose a € F a : then A'(a) is in the fixed field of Aut/v (A'F), which 
is K itself since K C A'F is Galois (condition (4) of Theorem 6.9). That is, a € K, 

23 Of course the extension k C k is not finite in general, so one needs the infinite version of 
Galois theory; cf. Remark 6.8. 
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and it follows that F° = FC\K. By Proposition 6.5, G — Autf n /<(F), concluding 
the proof. □ 

We will leave to the reader the pleasure of verifying that if both k C F and 
k C K are Galois, then FK is also Galois over k. and so is F C\ K (Exercise 6.13). 

Example 6.18. We have studied cyclotomic fields Q(C„) as extensions of Q; Q(£„) 
is the splitting field of x" - 1, so these extensions are Galois; we have proved 
(Proposition 5.16) that Aut<j(Q(^„)) is isomorphic to the group of units in Z/nZ. 

Now let k be any field of characteristic zero. The splitting field of x" - 1 over k 
is the composite k(Q of k and Q(0- By Proposition 6.17 the extension k C k(() is 
Galois, and Aut*.(fc(C)) is isomorphic to a subgroup of the group of units of Z/nZ. 
In particular, it is abelian. _i 

The previous example gives an important class of abelian Galois extensions. 
Remarkably, we can classify all cyclic Galois extensions, if we impose some restric- 
tion on the base field k. 

Proposition 6.19. Let k C F be an extension of degree m. Assume that k contains 
a primitive m-th root of 1 and char k does not divide 24 m. Then k C F is Galois 
and cyclic if and only if F = k(S), with S m € k. 

That is, under the given hypotheses on k, the cyclic Galois extensions of k 
are precisely those of the form k C k( v/c), with c 6 k. This is part of Kummer 
theory, a basic tool in the study of abelian extensions. Proposition 6.19 will be a 
key ingredient in onr application of Galois theory to the solvability of polynomial 
equations. 

Proof. Let £ e k be a primitive m-th root of 1. 

First assume that F — k(6). with S m = c € k. Then all m roots of the 
polynomial x m - c, 

<5, <<$, <*<5, ... ,C m - 1 <5, 

are in F, and F is generated by them; therefore F is the splitting field of the separa- 
ble (since char k does not divide m) polynomial x m - c over k; hence (Theorem 6.9, 
part (1)) k C F is Galois. 

To see that Aut*(F) is cyclic, note that every <p € Autfc(F) is determined 
by y>(<$), which is necessarily a root of x m — c; therefore <p(8) = CfS for some i, 
determined up to a multiple of m. This defines an isomorphism of Autt(F) with 
Z/mZ, as is immediately verified. 

Conversely, assume that k C F is Galois and Aut^.(F) is cyclic, generated by p. 
The automorphisms g>\ 1 = 0,..., m— 1, are pairwise distinct; thus they are linearly 
independent over F (Exercise 6.14). Therefore, there exists an a € F such that 25 

8 := a + CV(a) + • • • + < - ( «-V J (o) + V m_1 («) * 0. 


24 We will apply this result in §7.4; for convenience, we will take char k = 0 in that application, 
25 This is called a Lagrange resolvent. 
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Note that 

*Ktf) = *>(“) + rVw + • • • + c (m " V ,_1 («) + = c s 

(since C € fc, so <p(C) = C). This implies that S is not fixed by any nonidentity 
element of Autfc(F); that is, Aut^(^(F) = {e}; that is, k(5) = F. Further, 

¥>(«"*) = = rs m = «"*; 

that is, 5™ is fixed by <p, and hence by the whole of Autj.(F). Since k C F is 
Galois, this implies S m e k (Theorem 6.9, part (4)). That is, 6 satisfies the stated 
conditions. □ 


Exercises 


6.1. o Prove Lemma 6.3. {§6.1] 

6.2. Prove that quadratic extensions are Galois. 

6.3. > Let Ar C F be a Galois extension, and let E be an intermediate field. Prove 
that E C F is a Galois extension. {§6.2] 

6.4. t> Let A: C E be a finite separable extension. Prove that E may be identified 
with an intermediate held of a Galois extension At C F of At. 

In fact, prove that there is a smallest such extension As C F, in the sense that 
if A: C E C K, with k C K Galois, then there exists an embedding of F in K which 
is the identity on E. (The extension A; C F is the Galois closure of the extension 
kCE. It is clearly uniquely determined up to isomorphism.) {§6.3, 6.5] 

6.5. We have proved (Proposition 5.19) that all finite separable extensions kC E 
are simple. Let A; C F be a Galois closure of k C E (Exercise 6.4). For a € E, 
prove that E = k(a) if and only if a is moved by all tr e Aut*(F) n Aut,g(F). 

6.6. • Prove that Q C Q(\/2 H- \/2) is Galois, with cyclic Galois group. (Cf. Ex- 
ercise 1.25.) 

• Prove that Q C Q(y 3 + \/5) is Galois and its Galois group is isomorphic to 

(Z/2Z) x (Z/2Z). 

• Prove that Q C Q(yl+V2) is not Galois, and compute its Galois closure 
QCF. Prove that Aut<j(F) = Dg. (Use Exercise IV.2.16.) 

6.7. Let p > 0 be prime, and let d | e be positive integers, so that there is an 
extension F p d C F p « . Prove that Autp^ Fp« is cyclic, and describe a generator of 
this group. (This generalizes Proposition 5.8; use Galois theory.) 

6.8. Let At C F be a Galois extension of degree n, and let E be an intermediate 
field. Assume that \E : At] is the smallest prime dividing n. Prove k C E is Galois. 

6.9. Let A: C F be a Galois extension of degree 75. Prove that there exists an 
intermediate held E, with k Q E Q F, such that the extension At C E is Galois. 
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6.10. Let ikCFbea Galois extension and E an intermediate field. Prove that the 
normalizer of Autg(F) in Aut*(F) is the set of a € Aut*(F) such that tr(E) C E. 

Use this to give an alternative proof of the fact that E is Galois over fc if and 
only if Autjj(F) is normal in Aut*(F). 

6.11. Let fc C £ and E C F be Galois extensions. 

• Find an example showing that k C F is not necessarily Galois. 

• Prove that if every a € Aut*(E) is the restriction of an element of Aut*(F), 
then kC F is Galois. 

6.12. Find two algebraic extensions k C. F, k C K and embeddings F C fc, 
o’! : K C k, : K C fc extending k C k, such that the composites F<Ti(K), 
FtT 2 (K) are not isomorphic. 

Prove that no such example racists if F and K are Galois over k. 

6.13. t> Let k C F and k C K be Galois extensions, and assume F and K are 
sublields of a larger field. Prove that k C FK and k C F n K are both Galois 
extensions. [§6.4, §7.4] 

6.14. t> Let Ac C F be a field extension, and let . . . , tp m € Autjt(F) be pairwise 

distinct automorphisms. Prove that are linearly independent over F. 

(Recycle/adapt the hint for Exercise VI.6.15.) (§6.4, 6.16, §EX.7.6] 

6.15. Let fc C F be a Galois extension, and let or 6 F. Prove that. 

NkCF(a) = U <r(a), tr*cF(<*) = <r(a). 

»€Aut*(F) Autt(F) 

(Exercise 4.19.) 

6.16. > Let fc C F be a cyclic Galois extension of degree d, and let yp be a generator 
of Autjt(F). Let a € F be an element such that Nkcp(oc) = 1- 

• Prove that the automorphisms idj?, <p , . . . , <p d ~ l are linearly independent over F. 
(Exercise 6.14.) 

• Prove that there racists a 7 € F such that 

0 ■■= 7 + ayj(7) + aip( a)<p 2 (y) H 1- atp(a) ■ ■ ■ <f> d ~ 2 [a)ifi i ~ 1 ('y) 0. 

• Prove that a<p(a)tp 2 (a) ■ ■ ■<p d ~ 1 (a)tp d ( 7 ) = 7 , and deduce that a = /?/y?(/l). 

Together with the result of Exercise 4.20, the conclusion is that an element a of 
a cyclic Galois extension as above has norm 1 if and only if there exists a such 
that a = 

This is Hilbert’s theorem 90 (the 90-th theorem in Hilbert’s ZaMbericht, a report 
on the state of number theory at the end of the nineteenth century commissioned 
by the German Mathematical Society). [6.17, §IX.7.6, IX.7.18] 

6.17. Exercise 6.16 should remind the reader of the proof of Proposition 6.19, and 
for good reasons. Assume that fc C F is a cyclic Galois extension of degree m and 
that fc contains a primitive m-th root £ of 1. Use Hilbert’s theorem 90 to prove 
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that F = k(S) for a 5 such that 5™ E k, thereby recovering one direction of the 
statement of Proposition 6.19. (Hint: What is lVfccf(C _1 )‘0 

The case of Hilbert’s theorem 90 needed here is due to Kummer. 

6.18. Use Hilbert’s theorem 90 to find all rational roots a, b of the equation o? + 
b 2 d = 1, where d is a positive integer that is not a square. (Hint: o 2 + b 2 d equals 
■^Qco(y= 3 )( a + ^ v/ ~2)i' c f- Exercise 1.12.) 

6.19. -i Let k C F be a cyclic Galois extension of degree d, and let <p be a generator 
of Autk(F). Let a E F be an element such that tr*cF(a) = 0. Prove an ‘additive 
version’ of Hilbert’s theorem 90: 

• Prove that there exists a 7 £ F such that tr^cF(l) 7 ^ 0- 

• Stare at the expression 

ay»(7) + (a + <?(a + •• + (<* + <p(a) + • ■ • + </“ 2 (a))<? d " 1 (7)- 

• Prove that there exists a. 0E F such that a = 0- <p{0). 

Together with Exercise 4.20, this says that an element a of a cyclic Galois extension 
as above has trace 0 if and only if there exists a 0 such that a = 0 — <p(0). For an 
application, see Exercise 7.15. (7.15) 


7. Short march through applications of Galois 
theory 

Galois theory is a pervasive tool, with wide-ranging applications. We have neither 
the competence nor room in this book for an adequate survey of applications of the 
theory, but even the few examples reviewed in this section should suffice to convey 
its considerable power. 

7.1. Fundamental theorem of algebra. Theorem 6.15 and a little group theory 
are essentially all that is needed 26 to give an algebraic proof of the fundamental 
theorem of algebra, promised back in §V.5.3. 

Theorem 7.1. C is algebraically closed. 

Proof. Let f(x) E C[a:J be a nonconstant polynomial; we have to prove that f(x) 
has roots in C. Note that if f(x) has no roots in C, then neither does f(x)f(x) E 
R(x]; that is, we may assume that f(x) has real coefficients. Let F be a splitting 
field for f(x) over R; embed F in an algebraic closure of R (we ‘don’t know yet’ 
that C is algebraically closed!), and consider the extension 

RCF(i). 

This extension is Galois: it is the splitting field of the square-free part 27 of f(x)(x 2 + 
1). Let G = Aut R (F(i)). 

26 We are cheating a little, actually, since the intermediate value theorem will also be used be- 
hind the scenes. This is not too surprising: the completeness of R must enter the proof somewhere. 

27 That is, take each irreducible factor of the polynomial with a power of exactly 1. Over a 
perfect field, square-free polynomials are clearly separable. 
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By Sylow I (Theorem IV.2.5), G has a 2-Sylow subgroup H. The index [G : if] 
is odd, so it corresponds to a finite, separable extension 1C £ with [E : R] odd. 
However, every polynomial of odd degree over R has a real root (Exercise V.5.17, 
which only uses elementary calculus), and it follows that R C E is trivial (Exer- 
cise 5.23). 

Therefore [G : if] = 1, proving that G is a 2-group. Since C = R(i) C F(i), 
the Galois group of the (Galois) extension 

C C F(i) 

is a subgroup of G; therefore it is a 2-group: | Autc(F(i))| = 2" for some n > 0. 

Now recall (Proposition IV .2.6) that every group of order p n , with p prime, 
contains subgroups of order p m for all 0 < m < n. In particular, if n > 1, then 
Autc(F(i)) contains a subgroup of order 2"" 1 , which corresponds to a quadratic 
extension of C via the Galois correspondence. However, there are no quadratic 
extensions of C, because there are no irreducible quadratic polynomials in C[x] 
(Exercise 5.23 again) . 

This contradiction shows that ra = 0; that is, Autc(F(i)) is necessarily trivial. 
This proves that F(i) = C and in particular that C contains the roots of f(x). □ 

The argument can be simplified further, bypassing the use of Sylow’s theorem 
at the price of a slight increase in length. But why should we attempt to bypass 
Sylow’s theorem, after the effort put into proving it in Chapter IV? 

7.2, Const ructibility of regular n-gons. We return one last time to the issue 
of constructing regular n-gons. Simpler considerations have given us constraints 
on n that must necessarily be fulfilled for the regular n-gon to be construetible: if 
n = p is prime, we have found that p — 1 must be a power of 2 (§3.3); this was 
upgraded for any n to the condition that <f>(n) is a power of 2, in §5.2 (and an 
even more explicit condition is given in Exercise 5.19). But these are all ‘negative’ 
results: we have not as yet proved that if 4>( n ) is a power of 2, then the regular 
n-gon is construetible. This is where Galois theory comes into the picture. 

Proposition 7.2. Let kC F be a Galois extension, and assume [F : fc] = p r for 
some prime p and r > 0. Then there exist intermediate fields 

k = Eo C Ei C E 2 C • • ■ C E r = F 

such that [Ei : F,_i] = p for i— 1, . . . , r. 

Proof. As the Galois correspondence is bijective for Galois extensions (Theo- 
rem 6.9, part (5)), this statement follows immediately from the feet that a group 
of order p r , with p prime, has a complete series of p-subgroups; cf. for example the 
discussion following the statement of Theorem IV .2.8. □ 

Theorem 7.3. The regular n-gon is construetible by straightedge and compass if 
and only if 4>{n) is a power of 2. 

Proof. As recalled above, we have already established the =>■ direction. 
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For the converse, assume <j>(n) = 2 r for some r. The extension Q C Q(Crt) 
is Galois (it is the splitting field of $ n (x)), of order [Q(Cn) : Q] = 4>{ n ) = 2 r 
(Proposition 5.14). Proposition 7.2 shows that the condition given in Theorem 3.4 
is satisfied and therefore that C« is constructible, as needed. □ 


For example, ^>(17) = 16 = 2 4 ; the Galois group of Q(Ci 7 ) over Q is isomorphic 
to (Z/nzy (by Proposition 5.16), and therefore (Theorem IV.6.10) it is cyclic, of 
order 16. A generator of (2/17 Z)* is [6]u; it follows that Aut<j(Q(Ci 7 )) is generated 
by the automorphism a defined by Ci 7 i-» (fi 7 . The sequence of subgroups 

Aut Q (Q«i 7 )) = (t) 3 (or 2 ) 3 (<r 4 ) 3 (<7 8 ) 3 {e} 
corresponds via the Galois correspondence to a sequence 

Q C Q(5i) C Q(<5 2 ) C Q(<5g) C Q(Cx T ). 

It is instructive to apply what we now know to see how one may determine a 
generator fix for the first extension, that is, the fixed field of a 2 . Let C = Cl?- As 
a Q- vector space, Q(C) is generated by C? C 2 > • • ■ > C 18 - To find the fixed field of <r 2 , 
write out a general element of Q(C): 


®iC + ® 2 C 2 + U 3 C 3 + ® 4 C 4 + a s C s + a 6 C 6 + arC 7 + ®aC* + ® 9 C° 

+ 010C 10 + «nC U + ° 12 C 12 + 013C 13 + 014C 14 + oisC 15 + oieC 16 

and apply (C 6 ) 6 = C 38 = C 2; 


°iC 2 + u^C 4 + + 04C 8 + U5C 10 + ®eC 12 + “7C 14 + ®sC 16 + ®gC 

+ nioC 3 + °nC B + ® 12 C 7 + ai 3 C 9 + ®i 4 C n + ®isC 13 + ®i«C 1B - 

The element is fixed if and only if 


ai = 02 = 04 = og = aie = axs = 013 = 09, 
®3 = ®« = °12 = <* 7 = ®14 = = 05 = axo, 


and it follows that the fixed field of a 2 is generated by 


Si = C + C 2 + C* + C 8 + c 16 + C 1 ® + C 13 + c 9 


and by C 3 + C 8 + C 12 + C 7 + C 14 + C 11 + C 5 + C 10 = — 1 — 5i (remember that C is a 
root of $ 17 ( 3 :) = x 16 H (- a: + 1 ). Pictorially, 
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The sum of the roots marked by black dots is $ 1 ; the white roots add up to —1 — Si. 
The theory tells us that Si has degree 2 over Q, so S\ must be a rational combination 
of 1 and 5\ . Indeed, pencil and paper and a bit of patience (and the relation C 17 = 1) 
give 


S\ = (C + C 2 + C 4 + C 8 + C 16 + C 15 + C 13 + C 9 ) 3 
= 3(C + C 2 + C 4 + C s + C 16 +C 15 + C 13 + C 9 ) 

+ 4(C 3 + c 6 + < 12 + C 7 + C 14 + C 11 + < 5 + C 10 ) + 8 

= 35,+4(-l-<?,) + 8 
= -Si + 4. 

Thus, we find that + Si — 4 = 0. Solving for <5i (note that Si > 0, as the black 
dots to the right of 0 outweigh those to the left), 

. — 1 + \/l7 
* = 2 • 

So we could easily construct Si. The same procedure applied to the other extensions 
may be used to produce S 2 , S 3 , and finally £i?> whose real part is the complicated 
expression given at the end of §3.3. 


7.3. Fundamental theorem on symmetric functions. Let t\,. . ,,t n be inde- 
terminates. The polynomial 


P n (x) := (a: - ti) •■•(«-««) e Z[ti, ..., inH®] 

is universal in the sense that every polynomial of degree n with coefficients in (say) 
an integral domain may be obtained from P„(a:) by suitably specifying ti,...,tn, 
possibly in a larger ring (Exercise 7.2). The coefficients of the expansion 


Pn(x) = X n - Si(tl, . ..,tn)x n 1 + •• • + (-l)"Sn(tl, ■ ■ ■ ,t n ) 
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are (up to sign) the elementary symmetric functions of ti, . . . , t n . For example, for 
n = 3, 

s l(*li £ 2 ,^ 3 ) = <1 + <2 + *3t 
Sl (tl, < 2 ,^ 3 ) = tl <2 + M 3 + M3i 
« l(M<2,*3) = M 2 M 

The functions Sj(ti,...,t„) are symmetric in the sense that they are invariant 
under permutations of t\,...,t„ (because P„(x) is invariant); they are elementary 
by virtue of the following important result, which has a particularly simple proof 
thanks to Galois theory. 

Theorem 7.4 (Fundamental theorem on symmetric functions). Let K be a field, 
and let <p € K(ti,..,,t„). Then is symmetric if and only if it is a rational 
function (with coefficients in K) of the elementary symmetric functions si,...,s n . 

Here we are viewing the s,’s as elements of K\t \,. . . , t n ], which we can do 
unambiguously since Z is initial in Ring. 

Proof. Let F = K(t\, ...,t n ), and let k = K{s \, . . . , s„) be the subfield generated 
by the elementary symmetric functions over K. 

Then F is a splitting field of the separable polynomial P„ (x) over k. In partic- 
ular k C F is a Galois extension, and 

I Autfc(F)| = [F : k] < n! 

by Lemma 4.2. The symmetric group S n acts (faithfully) on F by permuting 
and this action is the identity on k = K (»\, . . . , s„) since each Sj is 
symmetric. Thus S„ may be identified with a subgroup of Aut*(F); but |S n | = n!, 
so it follows that Aut*(F) = S„. 

Since k C F is Galois, we obtain k = F s " . This says precisely that if <p 6 
K(t\,...,t„), then <p is invariant under the S„ action on t\,...,t„ if and only if 
<p e k = K(si , . . . , s„), which is the statement. □ 

It is not difficult to strengthen the result of Theorem 7.4 and prove that every 
symmetric polynomial is in fact a polynomial in the elementary symmetric functions. 

Note that the argument given in the proof amounts to the following result, 
which should be recorded for individual attention: 

Lemma 7.5. Let K be a field , 1 1 , . . . , t„ indeterminates , and let , ... ,s n be the 
elementary symmetric functions ont\,...,t n . Then the extension 

h(s\, , s n ) C K (<j, ...,<„) 

is Galois, with Galois group S„ . 

This result and Cayley’s theorem (Theorem II.9.5) immediately yield the fol- 
lowing: 

Corollary 7.6. Let G be a finite group. Then there exists a Galois extension k C F 
such that Autfc(F) = G. 
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Proof. Exercise 7.4. 


□ 


It is not known whether k may be chosen to be Q in Corollary 7.6. The proof 
hinted at above realizes k as a rather large field, and it is not at all clear how one 
may ‘descend’ from this field to Q. This is the inverse Galois problem, a subject 
of current research. The reader can appreciate the difficulty of this problem by 
noting that even realizing S n as a Galois group over Q (for all n) is not so easy, 
although it is true that for every n there are infinitely many polynomials of degree n 
in Z\r] whose splitting fields have Galois group S„ over Q (see Example 7.20 and 
Exercise 7.11 for the case n = p prime). If you want a memorable example to carry 
with you, Scliur showed (1931) that the splitting field of the ‘truncated exponential’ 


1 +* + 


2! 




has Galois group S„ over Q for njt 4 (and A„ for re = 4). 

It is known (Shafarevich, 1954) that every finite solvable group is the Galois 
group of a Galois extension over Q. 

The inverse Galois problem may be rephrased as asking whether every finite 
group is a quotient of the Galois group of Q over Q. YVe have already mentioned 
that this object (which may be interpreted as the ‘fundamental group of Q’; cf. Re- 
mark 6.16) is mysterious and extremely interesting 28 . 

Going back to Lemma 7.5, the alternating group A„ has index 2 in S„, so 
it corresponds to a quadratic extension of A'(si, . . . , s„) via the Galois correspon- 
dence. It is easy to determine a generator of this extension: it must be a function 

of <i t„ which is invariant under the action of all even permutations and not 

invariant under the action of odd permutations. We have used such a function 
precisely to define even/odd permutations, in §IV.4.3: 


a = n (u-tj). 

1 <’<j<n 


To see that A has degree 2 over K(si, . . . , s„), simply note that the discriminant 


d= a 2 = n (t.-tj ) 2 
!<»<><» 

is invariant under the action of all permutations, and therefore it belongs to the 
field A'(*i, . . . , s„) by Theorem 7.4. This proves 


Corollary 7.7. With notation as above, the extension 

A'(si,...,s„)(/D) C K(t\,...,t„) 


is Galois, with Galois group A„. 


28 According to J. S. Milne, the ‘most interesting object in mathematics' is the etale fun- 
damental group of the projective line over Q with the three points 0, 1 , oo removed. This is an 
extension of the Galois group of Q over Q. 
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7.4. Solvability of polynomial equations by radicals. This is possibly the 
most famous elementary application of Galois theory and one that is very close to 
Galois’ own original motivation. 

Quadratic polynomial equations 

x 2 + for + c = 0 

are famously solved (in 29 characteristic 2) by the quadratic formula: 

—b ± s/& ~ 4oc 
x 2 • 

Analogous, but more complicated, formulas exist for equations of degree 3 and 4 
(Tartaglia/ Cardano/Ferrari, around 1540; this generated a priority dispute which 
degenerated into base personal attacks). Equations of degree 5 and higher resisted 
the best efforts of mathematicians for centuries. The most ambitious goal, in line 
with what occurs for degrees 2, 3, and 4, would be to produce a formula for the 
solutions to the ‘general’ polynomial equation 

X™ + Otj-I*" - 2 + h Cto = 0 

in terms of the coefficients a; and basic operations such as taking roots. This would 
be a solution ‘by radicals’ of the equation. 

Well, such a formula simply does not exist: 

Theorem 7.8. The general polynomial equation of degree 5 or higher admits no 
solution by radicals. 

We are being intentionally vague about the ground field in which the problem 
is posed; a ‘formula’ such as the quadratic formula ought to hold over any field 
in which it makes sense; for example, the field should not have characteristic 2 
in the case of the quadratic formula. An easy way to avoid any such snag is 
to work in characteristic 2 ero: this has the main technical advantage of ensuring 
automatic separability of polynomials x m — 1, whose roots define the ‘radicals’; 
cf. Proposition 6.19. Therefore, we will assume in what follows that the ground 
field has characteristic 0. 

The ‘general polynomial’ is the ‘universal’ polynomial P n {x) introduced in §7.3, 
taken over a chosen ground field K: the general polynomial 

x n - six" -1 H + ( — l)"s„ e ft(x], 

with k = K(si, . . . , s n ) and S{ elementary symmetric functions in (‘indeterminate’) 
roots ti , . . . , tn . It is in fact no harder to deal with the more general case of arbitrary 
irreducible polynomials /(x) £ fc[x], over any field k (of characteristic zero); this is 
what we will do. 

A formula by radicals expresses a root U of /(x) as an element of an extension 
obtained from k in the following fashion: 

k C k(5i) C k(Si , &) C • • • C fc($i , . . . , <5 r ) 
where for all * = 1, . . . , r we have £ k(Si Si-i) for some exponent m;. 

29 In characteristic 2, one can give a formula in terms of ‘2-roots\ which are roots of the 
Artin-Schreier equation, and avoid the problematic division by 2; Bee Exercise 7.16. 
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Definition 7.9. Extensions of this type are called radical extensions. j 

Some facts about radical extensions are immediate from the definition: for 
example, if k C F and kC K are two radical extensions and F, K are subiields of 
a larger field, then k C FK is radical (Exercise 7.5). Also, 

Lemma 7.10. Every separable radical extension is contained in a Galois radical 
extension. 

Proof. Let k C F be a separable radical extension. In particular k C F is finite 
and separable, so F — k(a) for some a € F (Proposition 5.19). Let p(x) be the 
minimal polynomial of a over k. The splitting field L of p(x) over k may be obtained 
as the composite of <r(F) as a ranges over the embeddings of F in k, extending id*: 
indeed, this composite is generated over k by all <r(a), which range over all roots of 
p(x). As a composite of radical extensions, L is radical over k. As a splitting field 
of a separable polynomial, L is Galois over k (Theorem 6.9, part (1)), and we are 
done. □ 

Galois theory will allow us to relate solvability by radicals of a polynomial f(x) 
with a precise statement on the splitting field of /(*). Here is a first formalization 
of this connection: 

Lemma 7.11. Let k be a field of characteristic 0, and let f(x) € fc(a?] be an irre- 
ducible polynomial. Then there exists a formula solving f(x) by radicals if and only 
if the splitting field of f(x) is contained in a Galois radical extension. 

Proof. If the splitting field of /(*) is contained in a radical extension (Galois or 
not), then the roots may be written as combinations of field operations and radicals, 
as needed. 

For the converse, assume f(x) is solvable by radical. A formula for a root t\ 
can be turned into a formula for any other root U by applying an element a of 
(for example) Autfc(X) sending t\ to i; (such an automorphism exists since /( x) is 
irreducible). Thus, if a formula exists for one root, then a formula exists for all 
roots. The composite of all the corresponding radical extensions gives one, possibly 
larger, radical extension of k containing all roots of f(x). Therefore, the splitting 
field of f(x) is contained in a radical extension of k. By Lemma 7.10, this extension 
may be assumed to be Galois, as needed. □ 

In a nutshell, we will understand solvability of polynomial equations by radicals 
if we understand radical Galois extensions. As demanded by the general philosophy 
underlying Galois theory, we should aim to characterize these extensions in terms 
of their Galois groups. It turns out that the right condition (modulo technical 
considerations) is that the Galois group should be solvable. 

Definition 7.12. A Galois extension k C F is solvable if Autfc(F) is a solvable 
group. _i 

The reader may want to revisit §IV.3.3 now, for a reminder on solvability in 
group theory. (We are approaching an explanation for the ‘solvable’ terminology!) 
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Just as radical extensions decompose as sequence of cyclic extensions, solvable 
groups have cyclic composition factors (Proposition IV.3.11). We will say 30 that 
a radical, resp., solvable, extension k C F ‘has enough roots of 1’ if k contains a 
primitive M-th root of 1 for a common multiple M of the order of the corresponding 
cyclic factors. 

Lemma 7.13. Let k C F be a Galois extension, with char k = 0. Provided it has 
enough roots of 1, k C F is radical if and only if it is solvable. 

Proof. Modulo the fundamental theorem of Galois theory, this is a straightforward 
generalization of Proposition 6.19. 

Indeed, assume that k C F is radical: 

kCk(S 1 )C---Ck(S 1 ,...,S r ) = F 

with € k(S\, . . . By hypothesis, k contains primitive mj-th roots of 1 for 

ail i. Consider the corresponding series of subgroups of G = Aut*(F(0): 

G = G 0 2Gi 3-OG r = {e}, 

with Gj = Autfc(j, s,)(F)- Note that F is Galois over each intermediate field. 

Each extension 

k(S u ...,Si-i) C k(Si,...,Si) 

satisfies the hypothesis of Proposition 6.19, and it follows that it is Galois and cyclic. 
By the fundamental theorem (Theorem 6.15) each G, is normal in the previous 
Gi-\, with cyclic quotient, and it follows that G is solvable (Proposition IV.3.11 

(ii))- 

The same argument in reverse, using the converse implication in Proposi- 
tion 6.19, proves that solvable extensions with enough roots of 1 are radical. □ 

How do we account for the possible absence of the needed roots of 1? Surpris- 
ingly, this affects the statement in a rather minor way. 

Proposition 7.14. Let k be a field of characteristic 0, and let k C F be a Galois 
extension. Then k C F is solvable if and only if it is contained in a Galois radical 
extension. 

The characteristic 0 in Proposition 7.14 is an overshoot; as in Proposition 6.19, 
it would suffice to require that char k does not divide the relevant degrees. 

Proof. Assume Aut*..(F) is solvable. Let M be a common multiple of the order of 
the cyclic quotients, and let ( be a primitive M-th root of 1 in an algebraic closure 
k of k. The splitting field k(Q of x M — 1 over k is Galois over k (x M - 1 is separable 
since k has characteristic 0). By Proposition 6.17, the extension k(Q C F(() is 
also Galois, with Galois group isomorphic to Aut^jr^F). As k(() n F is Galois 
over k (Exercise 6.13), this subgroup is normal, and it follows (Corollary IV.3.13) 
that Aut fc( £)(F(0) »s solvable. 


^Warning! This is nonstandard terminology. 
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Since k(Q C F(£) has enough roots of 1, Lemma 7.13 implies that it is radical. 
So is k C F(Q, since k C k(Q is itself trivially radical. This proves that k C F is 
contained in a radical extension. 

Conversely, assume that k C F is Galois and contained in a radical Galois 
extension k C L. Let M be a common multiple of the order of cyclic factors in this 
extension, and let £ be a primitive A/-th root of 1. The composite L(Q is Galois 
and radical over k(Q (Proposition 6.17 and Exercise 7.5); further, it has enough 
roots of 1. 

By Lemma 7.13, k(Q C L(0 is solvable. It follows that k C L(C) is solvable. 
Indeed, this extension is Galois (Exercise 6.13); apply the fundamental theorem to 

k C *(C) C L ( 0 

to establish that Aut *(fc(0) > s isomorphic to the quotient of Aut *(£,(£)) by its nor- 
mal subgroup Aut fc(o (L(0). Both Aut* ((; )(L(0) (as shown above) and Aut*(fc(0) 
(an abelian group; cf. Example 6.18) are solvable, and it follows that Aut * (L(C)) 
is solvable as promised, by Corollary IV.3.13. 

Since F is an intermediate field and Galois over k, by the fundamental theorem 
Aut*(F) is a homomorphic image of Aut*.(L(C)). This shows that Aut*(F) is a 
homomorphic image of a solvable group; hence it is itself solvable (Corollary IV.3.13 
again), and we are done. □ 

We may have lost sight of where we were heading; but we are ready to reconnect 
with the study of solutions to polynomial equations. 

Definition 7.15. The Galois group Galfc(/(x)) of a separable polynomial f(x) 6 
fc[x] is the Galois group of the splitting field of /(x) over k. j 

Corollary 7.16. Let k be a field of characteristic 0, and let f(x) 6 fc[x] be an 
irreducible polynomial. Then f (x) is solvable by radicals if and only if its Galois 
group is solvable. 

Proof. This is an immediate consequence of Lemma 7.11 and Proposition 7.14. □ 

Corollary 7.16 is called Galois' criterion. Ruffini (1799) and Abel (1824) had 
previously established that general formulas in radicals for the solutions of equa- 
tions of degree > 5 do not exist (that is, Theorem 7.8); but it was Galois who 
identified the precise condition given in Corollary 7.16. Of course, Theorem 7.8 
follows immediately from Galois’ criterion; 

Proof of Theorem 7.8. By Lemma 7.5, the Galois group of the general polyno- 
mial of degree n is S„. The group S„ is not solvable for n > 5 (Corollary IV.4.21); 
hence the statement follows from Corollary 7.16. □ 

In fact, we could now do more: we know that S 3 and 54 are solvable (cf. Exer- 
cise 1V.3.16): from a composition series with cyclic quotients we could in principle 
decompose explicitly the splitting field of general polynomials of degree 3 and 4 
as radical extensions and as a consequence recover the Tartaglia/Cardano/Ferrari 
formulas for their solutions. 
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7.5. Galois groups of polynomials. Impressive as Galois’ criterion is, it does 
not in itself produce a single polynomials of degree (say) 5 over Q that is not 
solvable by radicals. Finding such polynomials requires a certain amount of extra 
work; we will be able to do this by the end of this subsection. 

Computing Galois groups of polynomials is in fact a popular sport among 
algebraists, excelling at which would require much more information than the leader 
will glean here. We will just list a few straightforward observations. 

lb begin with, recall that an element of Aut*(F) must send roots of a poly- 
nomial f(x) € &[a) to roots of the same polynomial, and if f(x) is irreducible and 
factors completely in F, then there are automorphisms sending any root of f(x) to 
any other root (Proposition 1.5, Corollary 1.7). This elementary observation may 
be rephrased as follows: 

Lemma 7.17. Let f(x) € fc[x] be a separable irreducible polynomial of degree n. 
Then Galjt(/(a:)) acts transitively on the set of roots of f(x) in k. In particular, 
Galjt(/(x)) may be identified with a transitive subgroup of the symmetric group S n . 

Of course a subgroup of S n is transitive if the corresponding action on (1 , . . . , n} 
is transitive; the diligent reader has run across this terminology in Exercise IV,4.12. 

The reader will easily produce a statement analogous to Lemma 7.17 in case 
f(x) is reducible. Of course the action is not transitive in this case, and if the 
factors of f(x) have degrees »i, then Gal* (/(a:)) is contained in a subgroup 

of S n isomorphic to S„, x •• • x S„ r . 

Lemma 7.17 (or its ‘reducible’ variations) already give some information. For 
example, we see that the Galois group of a separable irreducible cubic can only be 
A 3 or 5s, since these are the only transitive subgroups of S 3 . Similarly, the range 
of possibilities for irreducible polynomials of degree 4 is rather restricted: 54 , A 4 , 
and isomorphic copies of the dihedral group D$, of Z/2Z x Z/2Z, and of Z/4Z 
(Exercise 7.8). One approach to the computation of Galois groups of polynomials 
amounts to defining invariants imposing further restrictions, leading to algorithms 
deciding which of such possibilities occurs for a given polynomial. 

We have already encountered the most important of these invariants: if the 
roots of a separable polynomial f(x) 6 fc[x] in its splitting field are ai, . . . the 
discriminant of f(x) is the element D = A 2 , where 

& = n - a i)- 

l<t<?'<n 

Every permutation of the roots fixes D, so D must be fixed by the whole Galois 
group; therefore, D € k. Odd permutations move A (if charfc ^ 2), and even 
permutations fix it; therefore A is fixed by the Galois group G (in other words, 
A € k) if and only if G C A n - This proves 

Lemma 7.18. Let k be a field of characteristic 2, and let f(x) € fc[z] be a sepa- 
rable polynomial, with discriminant D. Then the Galois group of f(x) is contained 
in the alternating group A n if and only if D is a square in k. 
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Example 7.19. Lemma 7.18 and a discriminant computation are all that is needed 
to compute the Galois group of an irreducible cubic polynomial 

f(x) = x 3 + ax 2 + bx + c. 

It would be futile to try to remember the discriminant 

D = a 2 fc 2 — 4a 3 c — 46 3 + 18ahc — 27 c 2 ; 

but one may remember the trick of shifting x by a/3 (in characteristic ^ 3), with 
the effect of killing the coefficient of a 2 : 

/(*- f) = x 3 + px + q 

for suitable p and q. This does not change D (shifting all roots a; by the same 
amount has no effect on the differences ct; — ay), yet 

D = — 4p 3 - 27q 2 

is a little more memorable. 

By Lemma 7.18 and the preceding considerations, the Galois group is .43 if D 
is a square and S 3 otherwise. For example, a: 3 — 2 has Galois group S 3 over Q, since 
D = —108 is not a square in Q; x 3 — 3x + 1 has discriminant 81 = 9 2 , and therefore 
it has Galois group A 3 — Z/3Z. _i 

The reader will have no difficulty locating a discussion of the different pos- 
sibilities for polynomials of degree 4 and detailed information for higher degree 
polynomials. We will just highlight the following simple observation. 

Example 7.20. Let f(x) € Q[x] be an irreducible polynomial of degree p, where 
p is prime. Assume that f(x) has p - 2 real roots and 2 lionreal, complex roots. 
Then the Galois group of f(x) is S p . 

Indeed, complex conjugation induces an automorphism of the splitting field and 
acts by interchanging the two nonreal roots, so the Galois group G, as a subgroup 
of S p , contains a transposition. On the other hand, the degree of the splitting field 
(and hence |G|) is divisible by p, because it contains a simple extension of order p, 
obtained by adjoining any one root to ©. Since p is prime, G contains an element of 
order p by Cauchy’s theorem (Theorem IV.2.1); the only elements of order p in S p 
are p-cycles, so G contains a p-cycle. It follows that G = S p , by (a simple variation 
of) Exercise IV .4,7. 

For example, the Galois group of f(x) = x h — bx — 1 over Q is S 3 , giving a 
concrete example of a quintic that cannot be solved by radicals (Exercise 7.10). 

The reader will use this technique to produce polynomials of every prime de- 
gree p in Z[x) whose Galois group is S p (Exercise 7.11). j 

7.6. Abelian groups as Galois groups over Q. Having mentioned the inverse 
Galois problem in §7.3, we should point out that the reader is now in the position 
of proving that every finite abelian group may be realized as the Galois group of 
an extension over Q. 
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In fact, the reader can prove a much more precise result: every finite abelian 
group may be realized as the group of some intermediate field of the extension 
Q C Q(C n ) of Q in a cyclotomic field. This uses a number-theoretic fact: 

For every integer N, there are infinitely many primes p such that p s 1 
mod N. 

This is a particular case of Dirichlet’s theorem (1837), which states that if a, b are 
positive integers and gcd(a, 6) = 1, then there are infinitely many primes of the form 
a + nb with n > 0. The particular case a = 1, b = N needed here was apparently 
already known to Euler and can be proven by elementary means (in fact, there is a 
proof using cyclotomic polynomials: see Exercise 5.18). Assuming this fact, argue 
as follows: 

— By the classification theorem (Theorem 1V.6.6), every finite abelian group G 
is isomorphic to a product of cyclic groups 

( Z/niZ ) x • • • x (Z/n r Z). 

—By Dirichlet’s theorem, we can then choose distinct primes Pi such that p< = 1 
mod rij. 

— Let n = pi • • p r ; by the Chinese Remainder Theorem (Theorem V.6.1) 

(z/nzy (z/pizy x • ■ • x (z/p r z)*, 

and the i-th factor on the right-hand side is cyclic of order p, - 1, a multiple of n ( . 

— It follows that ( Z/nZy has a subgroup H such that G = (Z/nZ)*/H. 

— Since (Z/nZ)* = AutQ(Q(£„)) (Proposition 5.16) and cyclotomic fields are 
Galois over Q, H corresponds to an intermediate field F, Q C F C Q(C„). 

— Since Aut<j(Q(Ci)) is abelian, H is automatically normal; hence, Q C F is 
Galois, and Autq(F) = G, as needed. 

Thus, the ‘inverse Galois problem’ has an easy solution for abelian groups. 

A much stronger result is true: it can be proven that every finite abelian 
extension of Q is contained in some cyclotomic field. This is the Kronecker- Weber 
theorem, dating back to the second half of the nineteenth century. The natural 
context for this result is class field theory and is well beyond the scope of this book. 
The reader will be able to verify (Exercise 7.19) that every quadratic extension of Q 
is contained in some cyclotomic field. 


Exercises 


7.1. Find explicitly a generator of a quadratic intermediate field of the extension 

Q c Q«,o). 

7.2. > Let R be an integral domain, and let f{x) € /?(x] be a polynomial of degree n. 
Show how to obtain f(x) by specializing the ‘universal’ polynomial P„(x) defined 
in §7.3. [§7.3] 
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7 . 3 . Prove that the elementary symmetric functions sj , . . . , s„ (see §7.3) are alge- 
braically independent. (Hint: Use Exercise 1.28.) 

7 . 4 . i> Prove that every finite group is isomorphic to the group of automorphisms 
of some Galois extension. [§7.3] 

7 . 5 . > Let k C F be a radical extension, and let k C K be any extension; assume 
F and K are contained in a larger field. Prove that K C FK is radical. [§7.4] 

7 . 6 . Let f(x) € Q[x] be an irreducible cubic, and let p be a real root of /(x). Prove 
that the splitting field of /(x) over Q is Q(p, \fD), where D is the discriminant 
of /(x). 

7 . 7 . Let /(x) € Q[x| be an irreducible cubic with exactly one real root. Prove that 
the discriminant of /(x) is not a square in Q. 

7 . 8 . > (Cf. Exercise IV.4.12.) Find (mathematically or by library search) the lattice 
of subgroups of S 4 , and verify that the transitive subgroups of S 4 are (isomorphic 
to) S 4 , A 4 , Dg, Z/2Z x Z/2Z, and Z/4Z. [§7.5] 

7.9. Compute the Galois group of the polynomial x 4 - 2 . 

7.10. t> Prove that the polynomial x 5 - 5x - 1 has exactly 3 real roots (this is a 
calculus exercise!) and is irreducible over Q. Prove that its Galois group is S 5 . 
[§7-5] 

7 . 11 . t> Let n > 0 be an integer. Note that 

/„(x) := (x 2 + 2) • x • (x - 2) • • • (x - 2(n - 2)) • (x - 2(n - 3)) 

has n — 2 rational roots and 2 nonreal, complex roots. Prove that for infinitely 
many integers q, the polynomial 

q/„(x) + 2 e Z(x] 

has (n - 2) real roots, 2 nonreal complex roots, and is irreducible over Q. Conclude 
that for each prime p there are infinitely many polynomials of degree p in Z[x] 
whose Galois group is S p . [§7.3, §7.5] 

7 . 12 . -> Let /(x) 6 fc[x] be a separable irreducible polynomial of prime degree p 
over a field k, and let a t , . . . ,a p be the roots of /(x) in its splitting field F. Prove 
that the Galois group of /(x) contains an element o of order p, ‘cycling’ through 
the roots. [7.13] 

7 . 13 . -> Let /(x) € fc[x] be a separable irreducible polynomial of prime degree p 
over a field k. Let a be a root of /(x) in k, and suppose you can express another root 
of /(x) as a polynomial in a, with coefficients in k. Prove that you can express 
all roots as polynomials in a and that the Galois group of /(x) is Z/pZ. (Use 
Exercise 7.12.) [7.14] 

7 . 14 . -> Let A: be a field of characteristic p > 0, and let /(x) = x p — x — a 6 fc[x]. 
Assume that /(x) has no roots in k. Prove that /(x) is irreducible and that its 
Galois group is Z/pZ. (Note that if a is a root of /(x), so is a + 1; use Exercises 1.8 
and 7.13.) 

The splitting field of /(x) is an Artin-Schreier extension of F p . [7.16] 
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7.15. -i Let k C F be a cyclic Galois extension of degree p, where char k = p > 0. 
Prove that it is an Artin-Schreier extension. (Hint: What is trjtcf(l)V Use the 
additive version of Hilbert’s theorem 90, Exercise 6.19.) [6.19] 

7.16. t> The Artin-Schreier map on a field k of positive characteristic p is the 
function 

x >-» x p — x. 

Denote this fiinction by AS, and let denote its inverse (which is only defined 
up to the addition of an element of F p : see Exercise 7.14). 

Express the solutions of a quadratic equation x 2 + bx+c — 0 in characteristic 2 
in terms of for 6^0. (For 6^0, the roots must live in an Artin-Schreier 
extension of k, since the polynomial is then separable. It is inseparable for 6=0; 
solving the equation in this case amounts to extending k by the unique square root 
of c.) [§7.4] 

7.17. -i Let f(x) G fc[a:] be a separable irreducible polynomial of degree n over a 
field k, and let F be its splitting field. Assume Aut*(F) 3S S n , and let a be a root 
of f(x) in F. 

• Prove that Autk( a )(F) = S n - 1 . 

• Prove that there are no proper subfields of k( a) properly containing k. (Hint: 
Exercise IV .4. 8.) 

[7.18] 

7.18. Let f(x) e fc[a - ] be a separable irreducible polynomial of degree n over a 
field k, with Galois group S n - Let a be a root of f(x) in k, and let g(x) € fcfsc] be 
any nonconstaut polynomial of degree < n. Prove that « may be expressed as a 
polynomial in p(a), with coefficients in k. (Use Exercise 7.17.) 

7.19. t> Let d be a positive integer. 

• Prove that the discriminant of a polynomial f(x) = IlSLite — ««) equals the 
product ± nti /'(«>)• 

• Prove that the discriminant of x d - 1 is ±d d , and note that this is a square 

in Q(C0- 

• Prove that Q(Cs<i) contains square roots of d and —d. 

• Conclude that every quadratic extension of Q may be embedded in a cyclotomic 
field Q(Cn)> for a suitable n. 

(This is a very special case of the Kronecker- Weber theorem.) [§7.6] 




Chapter VIII 


Linear algebra, reprise 


We come back to linear algebra, with the task of studying slightly more sophis- 
ticated constructions than those reviewed in Chapter VI. Our main goals are to 
discuss some aspects of multilinear algebra, tensor products, and Horn, with spe- 
cial attention devoted to dual modules. Along the way we will meet several other 
interesting notions, including a first taste of projective and injective modules and 
of Tor and Ext. 

As in Chapter VI, we will attempt to deal with the more general case of R- 
modules, specializing to vector spaces only when this provides a drastic simpli- 
fication or conforms to inveterate habits. Covering the material at this level of 
generality gives us an excellent excuse to complement the preliminary notions de- 
scribed in the very first chapter of this book. 


1. Preliminaries, reprise 

1.1. Functors. In this hook we have given an unusually early introduction to the 
notion of category, motivated by the near omnipresence of universal problems at the 
very foundation of algebra. The hope was that the unifying viewpoint offered by 
categories would help the reader in the task of organizing the information carried 
by the basic constructions we have encountered. 

However, so far we have never seriously had to ‘go from one category' to another’, 
and therefore we have not felt compelled to complete the elementary presentation 
of categories by discussing more formally how this is done. The careful reader has 
likely seen it happen between the lines, at least in noninteresting ways: for example, 
if we strip a ring of its multiplicative structure, we are left with an abelian group; 
and if a field F is an extension of a field k, then we may view F as a k- vector space: 
thus, there are evident ways to go from Ring to Ab or from the category Fid* of 
extensions of k to fe-Vect. More interesting examples are the group of units of a 
ring, the homology of a complex, the automorphism group of a Galois field extension 
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(and note that this latter ‘turns arrows around’; see below), etc. Implicitly, we have 
in fact encountered many instances of such ‘functions between categories’. 

The notion of functor formalizes these operations. A functor between two 
categories C, D will ‘send objects of C to objects of D’, and since categories carry 
the information of morphisms, functors will have to deal with this information as 
well. 

Definition 1.1. Let C, D be two categories. A covariant functor 

& : C- D 

is an assignment of an object &(A) 6 Obj(D) for every A € Obj(C) and of a 
function 

Hom c (A, B) - Hom D (Jf(A), &(B)) 

for every pair of objects A, B in C; this function is also denoted 1 & and must 
preserve identities and compositions. That is, 

&(Ia) = l*(ji) 

VA 6 Obj(C), and 

&(f3o a) = o &(a) 

VA, B, C € Obj(C), Va € Homc(A, B), V/3 e Homc(B, C). 

A contravariant functor C — » D is a covariant functor C op — » D from the 
opposite category. j 

The opposite category C 0i> is obtained from C by ‘reversing the arrows’; cf. Ex- 
ercise 1.3.1. The fact that contravariant functors S? : C — » D (that is, covari- 
ant functors C op — *■ D) preserve compositions means that VA,B,C € Obj(C), 
Vq g Homc(A, B), V,3 e Homc(B, C): 

^o tt ) = «f(a)o^ . 

Pictorially, 

A^B-^C 

j9oa 

is sent to 

V(A) Sf (B) 4^ ff(C) 

0(poa) 

by a contravariant functor S^. Note the switch in the order of composition. A 
covariant functor ‘preserves’ arrows, in the sense that every diagram in C, say 



'This is a little unfortunate, since the function depends ou A and B, but in practice it does 
not lead to confusion. Some authors prefer &a,b> for clarity. 
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is mapped by a covariant functor & : C — ♦ D to a diagram with arrows in like 
directions: 


&(B) 


JF(A) 




^&{D) 




A contra variant functor V : C — » D ‘reverses’ the arrows: 



&(A)<r 


ns) 

<?(«*) 





nc) 



In both cases, commutative diagrams are sent to commutative diagrams. 

In many contexts, contravariant functors are called presheaves. Thus, a preshea} 
of sets on a category C is simply a contravariant functor C — » Set. 

If Homc(j4, B) carries more structure, we may require functors to preserve this 
structure. For example, we have seen that for C = fZ-Mod, the category of left- 
modules over a ring R, Houii?.Mod(A, B) is itself an abelian group (end of §111.5.2); 
a functor & : J?-Mod — > S- Mod is additive if it preserves this structure, that is, if 
the corresponding function 

HomR.Mod(A, B) -* Hom^.Mod (^ ( A) , &(B)) 
is a homomorphism of abelian groups (for all fZ-modules A, B). 


1.2. Examples of functors. In practice, an assignment of objects of a category 
for objects of another category often comes with a (psychologically) ‘natural’ com- 
panion assignment for morpliisms, which is evidently covariant or contravariant. 
We say that this assignment is ‘functorial’ if it preserves compositions. 

All the simple-minded operations mentioned at the beginning of §1.1 (such 
as viewing a ring as an abelian group under addition) are oovariant functors: for 
example, the fact that a homomorphism of rings is hi particular a homomorphism 
of the underlying abelian groups amounts to the statement that labeling each ring 
with its underlying abelian group defines a covariant functor Ring — » Ab. Since 
such functors are obtained by ‘forgetting’ part of the structure of a given object, 
they are memorably called forgetful functors. 

We have already mentioned briefly a few slightly more imaginative examples; 
here they are again. 

Example 1.2. If R is a ring, we have denoted by fZ* the group of units in R; 
every ring homomorphism R —> S induces a group homomorphism R* —t S', and 
this assignment is compatible with compositions; therefore this operation defines a 
covariant functor Ring — > Grp. j 
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Example 1.3. The operation Autfc(_) from Galois field extensions to groups is 
contravariantly functorial. Indeed, if k C E C F is viewed as a morphism of two 
Galois extensions (fc C E to k C F), we have a corresponding group homomorphism 

Aut k (F) -♦ Autjk(E) 

defined by restriction: <p »-» <p\e\ the latter maps E to E by virtue of the Galois 
condition. The composition of two restrictions is a restriction, so the assignment is 
indeed functorial. j 

Example 1.4. In §111.4.3 we have defined the spectrum of a commutative ring R, 
Spec R, as the set of prime ideals of R.. If R, S are commutative rings and tp : R.—* S 
is a ring homomorphism, then the inverse image <p -1 (p) of a prime ideal p is a prime 
ideal; thus, <p induces a set-function 

(p* : Spec(S) — > Spec(f?) . 

This assignment is clearly functorial, so we can view Spec as a contravariant functor 
from the category of commutative rings to Set. The reader will assign a topology 
to Spec(ft) (Exercise 1.7), making this example (even) more interesting. _j 

Example 1.5. If 5 is a set, recall (Example 1.3.4) that one can construct a cate- 
gory S whose objects are subsets of S and where morphisms correspond to inclusions 
of subsets. A presheaf of sets on S is a contravariant functor S — » Set. The pro- 
totypical example consists of the functor which associates to U C S' the set of 
functions from U to a fixed set; the inclusion U C V is mapped to the restriction 
p*-* p\u- More interesting (and very useful) examples may be obtained by consid- 
ering richer structures. For instance, if T is a topological space, one can define a 
category whose objects are the open sets in T and whose morphisms are inclusions; 
this leads to the notion of presheaf on a topological space. Standard notions such 
as ‘(the groups of) continuous complex-valued functions on open sets of T’ are most 
naturally viewed as the datum of a presheaf of abelian groups on T. Such concrete 
examples often satisfy additional hypotheses, which make them sheaves, but that is 
a topic for another book, hi a sense, studying a branch of geometry (e.g., complex 
differential geometry) amounts to studying spaces endowed with a suitable sheaf 
(e.g., the sheaf of complex differentiable functions). j 

An important class of functors that are available on any category may be ob- 
tained as follows, and this will be a source of examples in the next few sections. 
Let C be a category, and let X be an object of C. Then the assignments 

A~Hora c (X,A) , A ■-» Hom c (A, X) 

define, respectively, covariant and contravariant functors C — *■ Set. These are 
denoted Homc(X,_), Homc(_, X), respectively. For example, if 


A 


B 


8 


C 
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is a diagram in C, stare at 



X 


Every a : A — > B determines a function 

Horn C (B,X) - Home (A, X) 

defined by mapping 4 b to 4 a := 4b 0 a, that is, by requiring the triangle on the left 
to be commutative. The associativity of composition, 

4c o o a) = (4c o /3) o a , 

expresses precisely the contravariant functoriality of this assignment. The functor 
Homc(_,X) is often denoted hx for short, if C is understood. 

The brave reader may contemplate the fact that this technique associates to 
each object X of a category C a presheaf of sets hx on C. This raises natural and 
important questions, such as how to tell whether a given presheaf of sets & in feet 
equals hx for some object X of C (this makes & ‘representable’) or whether an 
object X may be reconstructed from the corresponding presheaf hx- For example, 
if Top is the category of topological spaces and p denotes the final object of Top 
(that is, the one-point topological space), then h\(p) is nothing but X, viewed 
as a set. For this reason, in such ‘geometric’ contexts (for example, in algebraic 
geometry) hx(A) is called the ‘set of A-points of X’; hx is the functor of points 
of X. 


1.3. When are two categories ‘equivalent’ ? Essentially every notion of ‘iso- 
morphism’ encountered so far has boiled down to ‘structure-preserving bijection’, 
and the reader may well expect that the natural notion identifying two categories 
should be drawn from the same model: a functor matching objects of one category 
precisely with objects of another, preserving the structure of morphisms. 

One could certainly introduce such a notion, but it would be exceedingly re- 
strictive. Recall that solutions to universal problems, that is, just about every 
construction we have run across, are only defined up to isomorphism (Proposi- 
tion 1.5.4). Requiring solutions in one context to match exactly with solutions in 
another would be problematic. The structure of an object in a category is ad- 
equately carried by its isomorphism class 2 , and a natural notion of ‘equivalence’ 
of categories should aim at matching isomorphism classes, rather than individual 
objects. The morphisms are a more essential piece of information; the quality of a 
functor is first of all measured on how it acts on morphisms. 


2 One should not take this viewpoint too far. It is tempting to think of isomorphic objects 
in a category as 'the same’, but this is also problematic and does not lead (as far as we know) to 
a workable alternative. For example, some objects in a category may have ‘more automorphisms’ 
than others, and this information would be discarded if we simply chopped up the category into 
isomorphism classes, draconially promoting all isomorphisms to identity morphisms. 
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Definition 1.6. Let C, D be two categories. A covariant functor & : C — » D is 
faithful if for all objects A, B of C, the induced function 

Hora c (A, B) -» Horn 0 (*(A),?{B)) 

is injective; it is full if this function is surjective, for all objects A, B. j 

‘Fully faithful’ functors & : C — t D are bijective on morphisius, and it follows 
easily that they preserve isomorphism classes: A = B in C if and only if & (A) = 
&(B) in D (Exercise 1.2). For all intents and purposes, a fully faithful functor 
& : C — » D identifies C with a full 3 subcategory of D, even if ‘on objects’ & may 
be neither injective nor surjective onto that subcategory. This idea is captured by 
the following definition: 

Definition 1.7. A covariant functor & : C — » D is an equivalence of categories if 
it is fully faithful and ‘essentially surjective’; that is, & induces Injections on Hom- 
sets, and for every object Y of D there is an object X of C such that &(X) Y 
in D. j 

Two categories C, D are equivalent if there is a functor # : C-+ D satisfying 
the requirements in Definition 1.7. It is not at all immediate that this is indeed an 
equivalence relation (why is it symmetric?), but this turns out to be the case. In 
fact, an equivalence of categories & has an ‘inverse’, in a suitably weakened sense: 
a functor 4? : D — * C such that there are ‘natural isomorphisms’ (see §1.5) from 
and $ o& to the identity. Since we will not use these notions too extensively, 
we will let the interested reader research the issue and find proofs. 

Example 1.8. (Cf. Exercise 1.3.6.) Construct a category by taking the objects to 
be nonnegative integers and Hoin(m, n) to be the set of n x m matrices with entries 
in a field k, with composition defined by product of matrices (and suitable care 
concerning matrices with no rows or columns). The resulting category is equivalent 
to the category of finite-dimensional fc- vector spaces. Indeed, we obtain a functor 
from the former to the latter by sending n to the vector space k n (endowed with 
the standard basis) and each matrix to the corresponding linear map. This functor 
is clearly fully faithful, and it is essentially surjective because vector spaces are 
classified by their dimension. This example makes (more) precise the heuristic 
considerations at the end of §VI.2.1. -i 

Example 1.9. Recall (Definition VII.2.19) that the coordinate ring of an affine 
algebraic set over a field K is a reduced, commutative, finite-type A-algehra. We 
can define the category K- Aff of affine A-algebraic sets by prescribing that the 
objects be algebraic subsets of some affine A-space and defining 4 Hoiuk-aaKS 1 * T) 
to be Hom K .A, g (A[T], A[S]). 

3 See Exercise 1.3.8 for the definition of ‘full’ subcategory. 

4 The switch in the order of S and T is reasonable: as the reader has verified in Exer- 
cise VII.2.12, A" [S] may be interpreted as the IC-algebra of ‘polynomial functions’ on 5; any 
function ip : S — * T determines a pull-back of functions, JT[X| — * KfS): 

S — ►T 

°\r y/£ 

k 
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Thus, A'-Aff is defined in such a way that the functor A'-Aff 0 — ► A'-AIg that 
maps an affine algebraic set S to its coordinate ring A" [5] is an equivalence of the 
opposite category of A'-Aff with the subcategory of reduced, commutative, finite- 
type A'-algebra. 

1.4. Limits and colimits. The various universal properties encountered along 
the way in this book are all particular cases of the notion of categorical limit, 
which is worth mentioning explicitly. Let & : I — * C be a covariant functor, where 
one thinks of 1 as a category ‘of indices’. The limit of & is (if it exists) an object L 
of C, endowed with morphisms A/ : L —* &(I) for all objects / of I, satisfying the 
following properties. 

• If cv : / — * J is a morphism in I, then A j = ,^(a) o A/: 


L 



• L is final with respect to this property: that is, if M is another object, endowed 
with morphisms pi, also satisfying the previous requirement, then there exists 
a unique morphism M — * L making all relevant diagrams commute 0 . 

The limit L (if it exists) is unique up to isomorphism, as is every notion defined 
by a universal property. It is denoted \im&. The ‘left-pointing’ arrow reminds 
us that L stands ‘before’ all objects of C indexed by I via & (but it is — up to 
isomorphisms — the ‘last’ object that may do so). This notion is also called inverse, 
or projective, limit. 

Example 1.10 (Products). Let I be the ‘discrete category’ consisting of two objects 
1, 2, with only identity morphisms 6 , and let .«*' be a functor from I to any category C; 
let A\ — ■ef'(l). A? = 2) be the two objects of C ‘indexed’ by I. Then lim sd is 

nothing but the product of A i and A 2 in C, as defined in §1.5.4: a limit exists if 
and only if a product of A\ and A^ exists in C. 

We can similarly define the product of any (possibly infinite) family of objects in 
a category as the limit over the corresponding discrete indexing category, provided 
of course that this limit exists. j 

The limit notion is a little more interesting if the indexing category 1 carries 
more structure. 

Example 1.11 (Equalizers and kernels). Let I again be a category with two objects 
1, 2, but assume that morphisms look like this: 


13 ►— Q := ft o yj. The definition of Hom/,.Aff(.1>. T ) is concocted so as to ensure that this pull-back 
operation is a homomorphism of A'-algebras. 

’’Any such datum of M and morphisms is called a cone for ■ Therefore, the limit of :? is 
a ‘universal cone': it is a final object in the (easily defined) category of cones of & . 

“This latter prescription is what makes the category ‘discrete’: cf. Example 1.3.3. 
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That is, add to the discrete category two ‘parallel’ morphisms a, (3 from one of the 
objects to the other. A functor X : I — » C amounts to the choice of two objects 
Ai, Ai in C and two parallel morphisms between them. Limits of such functors are 
called equalizers. For a concrete example, assume C = ft-Mod is the category of 
ft-modules for some ring ft; let <p : As -*■ Ai be a homomorphism, and choose X 
as above, with X (a) = and X (ft) = the zero-morphism. Then lim X is nothing 
but the kernel of <p, as the reader will verify (Exercise 1.14). _i 

Example 1.12 (Limits over chains). In another typical situation, I may consist of 
a totally ordered set, for example: 

>4 > 3 >2 > 1 

(that is, the objects are 1, for all positive integers t, and there is a unique morphism 
i — * j whenever i > j; we are only drawing the morphisms j + 1 — ► j). Choosing 
& : I — > C is equivalent to choosing objects Ai of C for all positive integers i and 
morphisms <pji : Ai — » Aj for all i > j, with the requirement that <pu = l^ f , and 
ipkj o <pj{ = (phi for all i>j>k. That is, the choice of amounts to the choice of 
a diagram 

> A 4 > A 3 > As » Ai 

in C. An inverse limit lim^ (which may also be denoted lim A,, when the mor- 

i 

phisms ipji are evident from the context) is then an object A endowed with mor- 
phisms <pi : A — * Ai such that the whole diagram 



commutes and such that any other object satisfying this requirement factors uniquely 
through A. 

Such limits exist in many standard situations. For example, let C = ft- Mod be 
the category of left- modules over a fixed ring ft, and let A,, ipj, be as above. 

Claim 1.13. The limit JimA* exists in ft-Mod. 

i 

Proof. The product JJ; A, consists of arbitrary sequences (o»),->o of elements a; e 
A,. Say that a sequence («i)i>o is coherent if for all i > 0 we have aj = <Pi,i+i(«i+i). 
Coherent sequences form an ft-submodule A of f[; A,; the canonical projections 
restrict to ft-module homomorphisms <fii : A -*■ Aj. The reader will check that A 
is a limit limA,- (Exercise 1.15). □ 

i 

This example easily generalizes to families indexed by more general posets. _i 

The ‘dual notion’ to limit is the colimit of a functor & : I — » C. The colimit is 
an object C of C, endowed with morphisms 7, : & (I) — > C for all objects I of J , 
such that 7/ = 77 o & (a) for all a : I —* J and that C is initial with respect to 
this requirement. 
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Again, we have already encountered several instances of this construction: for 
example, the coproduct of two objects Ai, A 2 of a category is (if it exists) the 
colimit over the discrete category with two objects, just as their product is the 
corresponding limit. Cokemels are colimits, just as kernels are limits (cf. Exam- 
ple 1 . 11 ). 

The colimit of & is (not surprisingly) denoted liin & and is called the direct, 
or injective, limit of & . 

For a typical situation consider again the case of a totally ordered set I, for 
example: 

1 > 2 >S >4 >••• 

A functor ^ : I — * C consists of the choice of objects and morphisms 

. $12 . , 'hs , ,, V' 3 < , , , 

A\ ► A 2 ► A 3 ► Ai > ■ ■ ■ 

and the direct limit lirnAi will be an object A with morphisms fa : Ai — ► A such 

t 

that the diagram 



commutes and such that A is initial with respect to this requirement. 

Example 1.14. If C = Set and all the faj are injective, we are talking about a 
‘nested sequence of sets’: 

A\ C A 2 C A3 C A4 C -. ; 

the direct limit of this sequence would be the ‘infinite union* (J, Ai. -1 

We have shamelessly relied on the reader’s intuition and used this notion al- 
ready, for example when we constructed the algebraic closure of a field (in §VII.2.1) 
or whenever we have mentioned polynomial rings with ‘infinitely many variables’ 
(e.g., in Example III.6.5). More formally, (Jj A; consists of equivalence classes of 
pairs (t, ftj), where «i € A, and (i, «i) is equivalent to (j, dj) for i < j if aj = faj(ai). 

In the context of R- modules, one can consider the direct sum D = A, and 
the submodule K C D generated by elements of the form aj — faj(ai) for all i < j 
(where each A, is identified with its image in D). Then the quotient D/K satisfies 
the needed universal property, as the reader will check. In fact, it is no harder 
to perform these constructions for more general posets (Exercise 1.16). They are 
somewhat more explicit and better behaved in the case of directed sets: partially 
ordered sets I such that Vi,y e I there exists k e I such that i < k, j < k. 

Several standard constructions rely on a direct limit: for example, germs of 
functions (or, more generally, ‘stalks of presheaves’) are defined by means of a 
direct limit. 
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1.5. Comparing functors. Having introduced functors as the natural notion of 
‘morphisms between categories’, the next natural step is to consider ‘morphisms 
between functors’ and other ways to compare two given functors. A detailed de- 
scription of these notions is not needed in this book, but we want to mention briefly 
a few concepts and essential remarks, again because they offer a unifying viewpoint. 


Definition 1.15. Let C, D be categories, and let <3 be (say, covariant) functors 
C — » D. A natural transformation T & is the datum of a morphism vx ' 

&(X) — » &(X) in D for every object X in C, such that Va : X — * Y in C the 
diagram 




,*j 

nx) 


^&{Y) 

V Y 

22* ton 


commutes. A natural isomorphism is a natural transformation v such that vx is 
an isomorphism for every X. _i 


Natural transformations arise in many contexts, for example when comparing 
different, notions of homology or cohomology in topology. The reader is likely 
to have run into the Hurewicz homomorphism (from ni to Hx)\ it is a natural 
transformation. In fact, any statement such as ‘there is a natural (or canonical) 
homomorphism. . . ’ is likely hiding a natural transformation between two functors. 
This technical meaning of the word ‘natural’ matches, as a rule, its psychological 
use. 

A particularly basic and important example is the notion of adjoint functors, 
which we will mention at a rather informal level, leaving to the inextinguishable 
reader the task of making it formal by suitable use of natural transformations. Let 
C, D be categories, and let & : C -* D, S? : 0 — > C be functors. We say that & and 
S? are adjoint (and we say that & is right-adjoint to & and & is left-adjoint to 5 4) 
if there are natural isomorphisms 

Hom c (X, <${Y)) — Hom D (^(A'), F) 

for all objects X of C and Y of D. (More precisely, there should be a natural 
isomorphism of ‘bifunctors’ C x D — * Set: Hoiuc(_,^(_)) Houid(^'(_), _).) 

Once again, several constructions we have encountered along the way may be 
recast in these terms, and we will run into more instances of this situation in this 
chapter. 

Example 1.16. The construction of the free group on a given set (§11.5) is con- 
cocted so that giving a set-function from a set A to a group G is ‘the same as’ 
giving a group homomorphism from F(A) to G (§11.5.2). What this really means 
is that for all sets A and all groups G there are natural identifications 

Homs«(A, S(G)) st Hom Grp (F(A), G) , 

7 The symbol is more standard than but it looks a little too much like the logical 
connective => for our taste. 
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where S(C) ‘forgets’ the group structure of G. That is, the functor F : Set — > Grp 
constructing free groups is left-adjoint to the forgetful functor 5 : Grp — * Set. 

This of course applies to every other construction of ‘free’ objects we have 
encountered: the free functor is, as a rule, left-adjoint to the forgetfid functor, j 

Thus, interesting functors may turn out to be adjoints of harmless-looking 
functors. This has technical advantages: properties of the interesting ones may be 
translated into properties of the harmless ones, thereby giving easier proofs of these 
properties. 

In fact, the very fact that a functor has an adjoint will endow that functor 
with convenient features. We say that .#■ is a left-adjoint functor if it has a right- 
adjoint. and that ( .4 is a right-adjoint functor if it has a left-adjoint.. Some properties 
of (say) right-adjoint, functors may be established without even knowing what the 
companion left-adjoint functor may be. Here is the prototypical example of this 
phenomenon. 

Lemma 1.17. Right-adjoint functors commute with limits. 

That is, if ^ : D — > C has a left-adjoint J? : C — * D and a/ : I — > D is another 
functor, then there is a canonical isomorphism 

#(lim si/) lim(^ o ss/) 

(if the limits exist, of course). As every good calculus student should readily 
understand, this says that right-adjoint functors are continuous. (Needless to say, 
left-adjoint functors turn out to commute with colimits and would rightly be termed 
cocontinnous.) 

We will not prove Lemma 1.17 in gory detail, but. we will endeavor to convince 
the reader that such statements are easier than they look and just boil down to 
suitable applications of the universal properties defining the various concepts. By 
contrast, proving that, a specific given functor preserves products (for instance), 
without appealing to abstract nonsense, may sometimes appear to involve some 
•real' work. 

Assume : D — » C is right-adjoint to : C — * D and : I — > D is a given 
functor. As we have seen, the limit of sV is final subject to fitting in commutative 
diagrams 

lint si/ 



s/(I) --^si'(J) 

•^(o) 

(with hopefully evident notation). Applying ff, we get commutative diagrams (in C) 

#(lim.i/) 



o .a/(I)- i<So si/(J) 

‘4os.V(t\) 
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and hence, by the universal property' defining the limit of o sV: 


*3 (lim #Y) 



Now, the morphisms (in C) 

lim(fy o sY) — ♦ Sf o sY{ I) 
determine, via the adjunction identification 

Hom c (X, &(Y)) — — — > Hom D (.je-(X), Y) , 

morphisms (in D) 

$ (lini(fff o #Y)) — > eY(I) 

By the universal property defining lim sY we get 

^■{lim(S? o a/)) 



Applying adjunction again, the new morphism 

& Qim(ff o #Y)) — » |im ,c/ 
determines a unique morphism 

(im(# o #Y) —* (Jim sY') . 

Summarizing, we have obtained natural morphisms 

^(lim.«/)-*lim(^o^) , Km($? o sY) -> VQimtY) . 

The compositions of these are easily checked to be the identity (by virtue of the 
uniqueness part of various universality properties invoked in the construction), con- 
cluding the verification of Lemma 1.17. 

What is missing from this outline is the explicit verification of the fact that 
every needed diagram commutes, which is necessary in order to apply the universal 
properties as stated. The reader will likely agree that such verifications, while 
possibly somewhat involved, must be routine. 

Lemma 1.17 implies, for example, that right-adjoint functors preserve products 
and that they must ‘preserve kernels' when kernels make sense. 
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Example 1.18 (Exact functors). A functor is exact if it preserves exactness, that 
is, it sends exact sequences to exact sequences. So far we have only studied exactness 
in the context of modules over a ring (§111.7.1), so let R, S be rings and let & : 
if- Mod — » 5-Mod be an additive functor i?-Mod — ► 5-Mod; & is exact if and only 
if whenever 

0 > A — ^-4 B — — * C >0 

is an exact sequence of i?-inodules, then 

0 4 ^(A) &(B) &(C) 4 0 

is an exact sequence of 5-modules. It follows easily that the image of every exact 
complex is exact (Exercise 1.23). 

A common and very interesting situation occurs when a functor is not exact 
but preserves ‘some’ of the exactness of sequences (we will encounter such examples 
later in this chapter, for example in §2.3). We say that an additive functor & is 
left-exact if whenever 


0 M — P —±C 


is exact, then so is 


0 »^(A)^^J?(B)-^JF(C) . 

It turns out that a left-exact functor gives rise to a whole sequence of ‘new’, related 
(‘derived’) functors; this hugely important phenomenon is studied in homological 
algebra, and we will come back to it in Chapter IX. 

Claim 1.19. Right-adjoint additive functors if -Mod — *• 5-Mod are left-exact. 

As the reader will verify (Exercise 1.27), this follows from the fact that right- 
adjoint functors preserve kernels. (Note that the exactness of the sequence amounts 
to the fact that <p : A B identifies A with ker V->.) 

Of course the corresponding co-statements hold: since left-adjoint functors pre- 
serve colimits and cokernel are colimits, it follows that left-adjoint additive functors 
9 : f?-Mod — > 5- Mod are necessarily right-exact, in the sense that if 

\fi lb 

A — — H3 — — >C >0 


is exact, then 


9(A) -^4 9(B) ^-4 C 


+ 0 


is also exact. 


j 




496 


VIII. Linear algebra, reprise 


Exercises 


1.1. Let & : C — * D be a covariaut functor, and assume that both C and D 
have products. Prove that for all objects A , B of C, there is a unique morphism 
.&(A x B) — * -^(A) x JF(B) such that the relevant diagram involving natural 
projections commutes. 

If D has coproducts (denoted II) and V : C -» D is contravariaut, prove that 
there is a unique morphism &(A) 1I&(B) — » S f(A x B) (again, such that an appro- 
priate diagram commutes). 

1.2. i> Let & : C — * D be a fully faithful functor. If A, B are objects in C, prove 
that A as B in C if and only if JF(A) = &(B) in D. [§1.3] 

1.3. Recall (§11.1) that a group G may be thought of as a groupoid G with a single 
object. Prove that defining the action of G on an object of a category C is equivalent 
to defining a functor G — » C. 

1.4. -> Let R be a commutative ring, and let S C R be a multiplicative subset in the 
sense of Exercise V.4.7. Prove that ‘localization is a functor’: associating with every 
/?-module M the localization S~ X M (Exercise V.4.8) and with every R - module 
homomorphism : M — » N the naturally induced homomorphism 5 -1 M — » S~ X N 
defines a covariant functor from the category of /^-modules to the category of S -1 R- 
modules. [1.25] 

1.5. Fbr F a field, denote by F* the group of nonzero elements of F, with multi- 
plication. The assignment Fid — ► Grp mapping F to F* and a homomorphism of 
fields <p: k-* F to the restriction tp\k~ :k* —*F * is clearly a covariaut functor. 

On the other hand, a homomorphism of fields® k —* F is nothing but a field 
extension k C F- Prove that the assignment F F* on objects, together with 
the prescription associating with every k C F the norm NkCF : F* —> k* (cf. Ex- 
ercise VH.1.12), gives a contravariant functor Fid — * Grp. State and prove an 
analogous statement For the trace (cf. Exercise VII.1.13). 

1.6. -> Formalize the notion of presheaf of abelian groups on a topological space T. 

If & is a presheaf on T, elements of .1F(U) are called sections of & on U. The 
homomorphism puv '■ &(U) & (V) induced by an inclusion V C U is called the 

restriction map. 

Show that an example of presheaf is obtained by letting &(U) be the additive 
abelian group of continuous complex-valued functions on U, with restriction of 
sections defined by ordinary restriction of functions. 

For this presheaf, prove that one can uniquely glue sections agreeing on over- 
lapping open sets. That is, if U and V are open sets and su € ^f(f/), »y € ^(V) 
agree after restriction to Ufl V, prove that there exists a unique s € if (£/U V) such 
that s restricts to su on U and to sv on V. 

This is essentially the condition making ^ a sheaf. [IX.1.15] 

8 Recall that there are no ‘zero-hoinoniorphisnis’ between fields; cf. §VII.1.1. 
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1.7. t> Define a topology on Spec R by declaring the dosed sets to be the sets V (I), 
where I C Spec R. is an ideal and V(I) denotes the set of prime ideals containing I. 

• Verify that this indeed defines a topology on Spec R. (This is the Zariski topology 
on Spec R.) 

• Relate this topology to the Zariski topology defined in §VII.2.3. 

• Prove that Spec is then a contravariant functor from the category of commutative 
rings to the category of topological spaces (where morphisms are continuous 
functions). 

(§ 1 - 2 ] 

1.8. Let if be an algebraically closed field, and consider the category K-Aff defined 
in Example 1.9. 

« Denote by hs the functor Hom/c.AfK—i S) (as in §1.2), and let p = A° K be a 
point. Show that there is a natural bisection between S and As(p). (Use Exer- 
cise VI1.2.14.) 

• Show how every <p € Hom/f.Aff(^i T) determines a function of sets S —*T. 

• If S C AJJ, T C A’x, show that the function S ~*T determined by a morphism 
ip € Homjr-Aff(i>, T) is the restriction of a ‘polynomial function’ A ’ft — ► A&. 
(Part of this exercise is to make sense of what this means!) 

1.9. -i Let C, D be categories, and assume C to be small. Define a functor cate- 
gory D c , whose objects are covariant functors C — * D and whose morphisms are 
natural transformations 9 . 

Prove that the assignment X t-» hx '■= Homc(_j X ) (cf. §1.2) defines a oovariaut 
functor C — » Set c . (Define the action on morphisms in the natural way.) [1.11, 

IX. 1.11] 

1.10. -> Let C be a category, X an object of C, and consider the contravariant 
functor hx := Homc(_,X) (cf. §1.2). For every contravariant functor & : C — » Set, 
prove that there is a bijection between the set of natural transformations hx & 
and & (X), defined as follows. The datum of a natural transformation hx & 
consists of a morphism from hx(A) = Home (A, X) to & (A) for every' object A 
of C. Map hx to the image of id* 6 hx(X) in &(X). (Hint: Produce an inverse 
of the specified map. Fbr every / 6 &(X) and every' <p € Home (A, X), how do you 
construct an element of 

This result is called the Yoneda lemma. [1.11, EX.2.17] 

1.11. (Cf. Exercise 1.9.) Let C be a small category. A contravariant functor 
C — ♦ Set is representable if it is naturally isomorphic to a functor hx (cf. §1.2). In 
this case, X ‘represents’ the functor. Prove that C is equivalent to the snbeategory 
of representable functors in Set c . (Hint: Yoneda; see Exercise 1.10.) 

Thus, every (small) category is equivalent to a subcategory of a functor cate- 
gory. 

®The reason why C is assumed to be small is that this ensures that the natural transformations 
between two fuDCtors do form a set. 
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1.12. Let C, D be categories, and let & : C -+ D, £? : D — » C be functors. Prove 
that & is left-adjoint to 5? if and only if, for every object V in D, the object &(Y) 
represents the functor hy ° & ■ 

1.13. Let Z be the ‘Zen’ category consisting of no objects and no morphisms. One 
can contemplate a functor 2? from Z to any category C: no datum whatsoever need 
be specified. What is lim iST (when such an object exists)? 

1.14. > Verify that the construction described in Example 1.11 indeed recovers the 
kernel of a homomorphism of il-modules, as claimed. [§1.4] 

1.15. > Verify that the construction given in the proof of Claim 1.13 is an inverse 
limit, as claimed. [§1.4] 

1.16. > Flesh out the sketch of the constructions of colimits in Set and fi-Mod 
given in §1.4, for any indexing poset. In Set, observe that the construction of the 
colimit is simpler if the poset I is directed; that is, if Vi,j € I, there exists a k € I 
such that i < k, j < k. [§1.4] 

1.17. -■ Let R be a commutative ring, and let / C A be an ideal. Note that /" C I m 
if n > m, and hence we have natural homomorphisms <p mn : R/I n — * R/I m for 
n > m. 

• Prove that the inverse limit Rj := \imR/I n exists as a commutative ring. This 

n 

is called the I-adic completion of R. 

• By the universal property of inverse limits, there is a unique homomorphism 
R-* Rj. Prove that the kernel of this homomorphism is (")„ 7 n - 

• Let I = (x) in f?[x]. Prove that the completion R\x] t is isomorphic to the power 
series ring i2[[x]] defined in §111.1.3. 

[1.18, 1.19] 

1.18. Let R be a commutative Noetherian ring, and let / C R be an ideal. Then 
/ • fl n J" = fin I n ~' the reader will prove this in Exercise 4.20. 

Assume this, and prove that f)„ ? n equals the set of r € R such that (1 — o)r = 0 
for some a 6 I. (Hint: One inclusion is elementary. For the other, use the Nakayama 
lemma in the form of Exercise VI.3.7. This result is attributed to Krull.) 

For example, if I is proper, then p|„ = (0) if R is an integral domain or if 

it is local. In these cases, the natural map R —> Ri to the I-adic completion is 
injective (cf. Exercise 1.17). 

1.19. -• An important example of the construct ion presented in Exercise 1.17 is the 
ring Ii P of p-adic integers: this is the limit limZ/p r Z, for a positive prime integer p. 

r 

The field of fractions of Z p is denoted Q p ; elements of Q p are called p-adic numbers. 

• Show that giving a p-adic integer A is equivalent to giving a sequence of integers 
A t , r > 1, such that 0 < A r < p r , and that A s = A r modp* if s < r. 

• Equivalently, show that every p-adic integer has a unique infinite expansion 

A = oo + ai ■ p + «2 • p 2 + 03 • p 3 -I , where 0 < a, < p — 1. 
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The arithmetic of p-adic integers may be carried out with these expansions in 
precisely the same way as ordinary arithmetic is carried out with ordinary decimal 
expansions. 

• With notation as in the previous point, prove that A 6 Z p is invertible if and 
only if no ^ 0. 

• Prove that Z p is a local domain, with maximal ideal generated by (the image in 
Z p of) p. 

• Prove that Z p is a DVR (cf. Exercise V.2.19). (There is an evident valuation on 
Qp-) 

[§n.2.3] 

1.20. -i If to, n are positive integers and m | n, then (n) C (to), and there is an 
onto ring homomorphism Z/nZ -*» Z/raZ. The limit ring limZ/nZ exists and is 

fyj 

denoted by Z. Prove that Z = EndAb(Q/Z). (Every / e EndAb(Q/Z) is determined 
by /(£); note that since n£ = 1 3 OmodZ, /(L) = for some integer g(n), 
which may be chosen so that 0 < g(n) < n. Show' that g(m) = g(n) mod to if 
m | n, and think about how elements of Z may be described.) [1.21] 

1.21. Let Z be as in Exercise 1.20. 

• If R is a commutative ring endowed with homomorphisms R —* Z/p T Z for all 
primes p and all r, compatible with all projections Z/p r Z -* Z/p'Z for s < r, 
prove that there are ring homomorphisms R — > Z/nZ for all n, compatible with 
all projections Z/nZ —* Z/mZ for m | n. 

• Deduce that Z satisfies the universal property for the product of Z p , as p ranges 
over all positive prime integers. 

It follows that JJ p Z, p = Z = End/\b(Q/Z). 

1.22. Let C be a category, and consider the ‘product category’ C x C (make sense 
of such a notion!). There is a ‘diagonal’ functor associating to each object X of C 
the pair ( X , X ) as an object of C x C. On the other hand, there may be a ‘product 
functor’ C x C — ► C, associating to (X, Y) a product X x Y; for example, this is 
the case in Grp. Convince yourself that the product functor is right-adjoint to the 
diagonal functor. If there is a coproduct functor, verify that it is left-adjoint to the 
diagonal functor. 

1.23. > Let R, S be rings. Prove that an additive covariant functor & : E-Mod — * 

S- Mod is exact if and only if -P(A) *&(B) (C) is exact in 5-Mod when- 

ever A — is exact in E-Mod. Deduce that an exact functor sends exact 
complexes to exact complexes. [§1.5, IX.3.7] 

1.24. Let E, S be rings. Au additive covariaut functor & : E-Mod — * S-Mod is 
faithfully exact if ‘ &(A) - (C) is exact in 5-Mod if and only if 
A — — — >C is exact in E-Mod*. Prove that au exact functor : E-Mod -» 
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S-Mod is faithfully exact if and only if &{M) ^ 0 for every nonzero ft-module M, 
if and only if &{<p) ^ 0 for every nonzero morphism <p in ft-Mod. 

1.25. -> Prove that localization (Exercise 1.4) is an exact functor. 

In feet, prove that localization ‘preserves homology’: if 

M,: »M m >••• 

is a complex of ft-modules and S is a multiplicative subset of ft, then the localization 
S~ l Hi(M t ) of the i-th homology of M, is the i-th homology ifj(S -1 M.) of the 
localized complex 

> S-'Miu 5 —^ S-'-Mi -^4 S~ l Mi - 1 > • • • 

[ 2 . 12 , 2 . 21 , 2 . 22 ] 

1.26. Prove that localization is faithfully exact in the following sense: let 2? be a 
commutative ring, and let 

(*) 0 *A >B >C >0 

be a sequence of ft-modules. Then (*) is exact if and only if the induced sequence 
of .ftp-modules 

0 >A P >B P >C P >o 

is exact for every prime ideal p of ft, if and only if it is exact for every maximal 
ideal p. (Cf. Exercise ¥.4.12.) 

1.27. t> Let ft, 5 be rings. Prove that right-adjoint functors ft-Mod — > S-Mod are 
left-exact and left-adjoint functors are right-exact. [§1.5] 


2. Tensor products and the Tor functors 

In the rest of the chapter we will work in the category ft-Mod of modules over a 
commutative ring ft. Essentially everything we will see can be upgraded to the 
noncommutative case without difficulty, but a bit of structure is lost in that case. 
For example, if ft is not commutative, then in the category ft-Mod of te/i- ft-modules 
the Hom-sets Hom/j.Mo<i(Afi 2V) are ‘only’ abelian groups (cf. the end of §111.5.2). 
A tensor product M ® r N can only be defined if M is a right-ft-module and N is 
a left-ft-module (in a sense, the two module structures annihilate each other, and 
what is left is an abelian group). By contrast, in the commutative case we will be 
able to define M ®rN simply as an ft-module. In general, the theory goes through 
as hi the commutative case if the modules carry compatible left- and right- module 
structures, except in questions such as the commutativity of tensors, where it would 
be unreasonable to expect the commutativity of ft to have no bearing. All in all, the 
commutative case is a little leaner, and (we believe) it suffices in terms of conveying 
the basic intuition on the general features of the theory. 

Thus, ft will denote a fixed commutative ring, unless stated otherwise. 
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2.1. Bilinear maps and the definition of tensor product. If At and A are 
P-modules, we observed in the distant past (§111.6.1) that At © A serves as both 
the product and coproduct of A/ and A": a situation in which a limit coincides with 
a colimit. As a set, M © A is just M x A; the P-module structure on M © A 
is defined l)y componentwise addition and multiplication by scalars. An P-module 
honioniorphisni 

At © N - P 

is determined by P-module homoinorphisnis At —> P and N — * P (this is what 
makes A/ © A into a coproduct). 

But there is another way to map At x A' to an P-module P. compatibly with 
the P-module structures. 

Definition 2.1. Let A/, A, P be P-niodules. A function : At x N — » P is 
R -bilinear if 

• V»; € M. the function n is an P-module homomorphism N — * P. 

• Vn € A r , the function in >-> y?( m.n ) is an P-module homomorphism At — * P.j 

Thus, if y> : jW x A' — * P is P-bilinear. then V;n € M , Vn i , n. 2 € N . Vr i . r 2 € P. 
V?(m,rint + r 2 n 2 ) = ri^(m. ni) + »' 2 v?(»»,n 2 ), 
and similarly for ^(_, n). 

Note that •p itself is not linear, even if we view M x N as the P-module AJ ©A’, 
as recalled above. On the other hand, there ought to be a way to deal with P- 
bilinear maps ‘as if' they were P-linear, because such maps abound in the context 
of P-inodules. For example, the very multiplication on P is itself an P-bilinear map 

PxP-P. 

Our experience with universal properties suggests the natural way to approach this 
question. What we need is a new P-module M &/? N, with an P-bilinear map 

® : A I x A r — M ®p N, 

such that every P-bilinear map M x N — * P factors uniquely through this new 
module M Mr A r . 


A I x N -> P 



M 3 n A 


in such a way that the map is a usual P-module honioniorphisni. 

Thus. A/®/) A’ would be the ‘best approximation’ to A/ x AT available in P-Mod, 
if we want to view P-bilinear maps from A/ x A ; as R-linear. The module M ®/? A r 
is called the tensor product of M and N over P. The subscript P is very important: 
if At and j V arc modules over two rings P, S, then S-bilinearity is not the same 
as P-bilinearity, so A/ Mr A' and A/ ®s A' may be completely different objects. In 
context, it is not unusual to drop the subscript if the base ring is understood, but 
we do not recommend this practice. 
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The prescription given above expresses the tensor product as the solution to 
a universal problem; therefore we know right away that it will be unique up to 
isomorphism, if it exists (Proposition 1.5.4 once more), and we could proceed to 
study it by systematically using the universal property. 

Example 2.2. For all if-modules N, i?®j? N s N. 

Indeed, every i?-bilinear R x N —* P factors through N (as is immediately 
verified): 


R xN >P 



N 


where ®(r, n) = rn. By the uniqueness property of universal objects, necessarily 
NZR®rN. -i 

For another example, it is easy to see that there must be a canonical isomor- 
phism 

M ® jr N N ®ft M. 

Indeed, every fit-bilinear <p : M x N — * P may be decomposed as 10 

Mx N— ->N xM^-^P, 

* 

where ip(n, to) = tp(m, n); $ is also P-bilinear, so it factors uniquely through JV®ft 
M. Therefore, <p factors uniquely through N ®r M , and this is enough to conclude 
that there is a canonical isomorphism N ®# M = M ®/? N. 

However, such considerations are a little moot unless we establish that M®rN 
exists to begin with. This requires a bit of work. 

Lemma 2.3. Tensor products exist in f?-Mod. 


Proof. Given P-modules M and N, we construct ‘by hand’ a module satisfying 
the universal requirement. Let F R (M x N) = be the free P-module on 

M x N (§111.6.3). This module comes equipped with a set-map 

j-.MxN -► F r (M x 1 V), 

universal with respect to all set-maps from Mx N to any ff-module P; the main task 
is to make this into an R-bilinear map. For example, we have to identify elements 
in F r {M x N) of the form + r^) with elements j(m, n t ) + j(tn,ni), etc. 

Thus, let K be the i?-submodule of F R (M x N) generated by all elements 

j(m,rini + r 2 n 2 ) - ny(m,ni) - r 2 j(m,n 2 ) 


and 


j(rxTOi + r 2 m 2 ,n) - rij{m lt n) - r 2 j(m 2 ,n) 


l0 Here is one situation in which the commutativity of R does play a role: if R is not conunu- 
tative, then this decomposition becomes problematic, even if M and N carry bimodule structures. 
One can therefore not draw the conclusion M<S)rN^N®rM in that cose. 
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as range in M, n,ni,n 2 range in N, and ri,r 2 range in i?. Let 

F R (M x N) 


M ®r N := 


K 


endowed with the map ® : M x N — » M ®r N obtained by composing j with the 
natural projection: 

® : M x N — ^ F r (M x N) > M® R N = F R (M x N)/K 

The element ®(m, n) (that is, the class of j(m, n) modulo K) is denoted m ® n. 

It is evident that (m,n) — » m® n defines an f?-bilinear map. We have to check 
that M ®r N satisfies the universal property, and this is also straightforward. If 
tp : M x N -* P is any U-bilinear map, we have a unique induced il-linear map <p 
from the free i?<-module, by the universal property of the latter: 


MxN 



F r (M x AO 


We claim that <p restricts to 0 on if. Indeed, to verify this, it suffices to verify 
that ip sends to zero every generator of K , and this follows from the fact that p is 
.R-bilinear. For example, 

+ r 2 n 2 ) - nj{m, n y ) - r 2 j(m, n 2 )) 

= <p(j{™,r i«i +r 2 n 2 ))-r 1 <p(j(m,n l ))- r 3 <p(j(m,n 2 )) 

= <p(m, Tyni + r 2 R 2 ) - T X p(m , «!) - r 2 tp(m, n 2 ) 

= 0 . 


It follows (by the universal property of quotients!) that <p factors uniquely through 
the quotient by K: 


MxN 



®( F r (M x N) 


M® r N= F r (M x N)/K 


and we are done. 


□ 


As is often the case with universal objects, the explicit construction used to 
prove the existence of M ® R N is almost never invoked. It is however good to 
keep in mind that elements of M ®rN arise from elements of the free itmodule on 
M x N, and therefore an arbitrary element of M®rN is a finite linear combination 

(*) 53 r ^ mi ® ”0 
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with r< € R, m* 6 M, and n, g W. The R-bilinearity of ® : M x TV — ► M ®r N 
amounts to the rules: 

m ® (ni + n^) = m ® «i 4- m ® 712, 

(mi + m 2 ) ® n = mi ® n + m 2 ® n, 
m ® (rn) = (m) ® n = r(m ® n), 

for all m, mi , m 2 e M, nj , n 2 , n € IV, and r e R. In particular, note that the coeffi- 
cients Ti in (*) are not necessary, since they can be absorbed into the corresponding 
terms ® n*: 

Yl n(m ® ni) = ^(n-mi) ® n*. 
i i 

Elements of the form m® n (that is, needing only one summand in the expres- 
sion) are called pure tensors. Dear reader, please remember that pure tensors are 
special: usually, not every element of the tensor product is a pure tensor. See Exer- 
cise 2.1 for one situation in which every tensor happens to be pure, and appreciate 
how special that is. 

Pure tensors are nevertheless very useful, as a set of generators for the tensor 
product. For example, if two homomorphisms a,0 : M S>r N — * P coincide on 
pure tensors, then a = 0. frequently, computations involving tensor products are 
reduced to simple verifications for pure tensors. 

2.2. Adjunction with Horn and explicit computations. The tensor product 
is left-adjoint to Horn. Once we parse what this rough statement means, it will 
be a near triviality; but as we have found out in §1.5, the mere fact that is 
left-adjoint to any functor is enough to draw interesting conclusions about it. 

First, we note that every R-module N defines, via ®«, a new covariaut. functor 
R-Mod — * R-Mod, defined on objects by 

MwM ®/j N. 

To see how this works on morphisms, let 

or : Mi — » M 2 

be an R-module homomorphism. Crossing with N and composing with ® defines 
an R-bilinear map 

Mi x N — ► M 2 x N -* M 2 ® N, 
and hence an induced R-linear map 

o® N : Mi ® N — ■ M 2 ® W. 

On pure tensors, this map is simply given by m ® n « «(m) ® n, and functoriality 
follows immediately: if 0 : Mo — * Mi is a second homomorphism, then (o ® N) o 
(0 ® N) and (a o 0) ® N both map pure tensors m ® n to a(/3(m)) ® n, so they 
must agree on all tensors. 

The adjunction statement given at the beginning of this subsection compares 
this functor with the covariant functor P 1 — * Homfl_Mod(lV, R); cf. §1.2. Let’s see 
more precisely how it works. 
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We have defined M <8>r N so that giving an /{-linear map M ®r N — > P to an 
/{-module P is ‘the same as’ giving an /{-bilinear map M x N — » P. Now recall 
the definition of /{-bilinear map: p : M x N — * P is /{-bilinear if both p(m, _) 
and p(_, n) are /{-linear maps, for all m € M and n € N. The first part of this 
prescription says that ip determines a function 

M — Horn r(N, P): 

the second part says that this is an /{-module homomorphism. Therefore, an R- 
bilinear map is ‘the same as’ an element of 

Horn/} (A/, Homn(AT, P)). 

These simple considerations should be enough to make the following seemingly 
complicated statement rather natural: 

Lemma 2.4. For all R-modules M, N, P, there is an isomorphism of R-modules 
Hom w (A/, Horn „(N, P)) Hom R (M ® R N, P). 

Proof. As noted before the statement, every a € Honifi(A-/, Hoiur(/V, P)) deter- 
mines an /{-bilinear map p : M x N — * P, by 

(m,n) *-> Q(m)(n). 

By the universal property, tp factors uniquely through an /{-linear map p : M 
N — * P. Therefore, a determines a well-defined element p € Horn/} (A/ ® R N, P). 

The reader will check (Exercise 2.11) that this map a >-* p is /{-linear and 
construct an inverse. □ 

Corollary 2.5. For every R-module N . the functor _ N is left-adjoint to the 
functor Honi/?(A r , _). 

Proof. The claim is that the isomorphism found in Lemma 2.4 is natural in the 
sense hinted at, but not fully explained, in §1.5; the interested reader should have 
no problems checking this naturalitv. □ 

By Lenuna 1.17 (or rather its co-version), we can conclude that for each R- 
module N, the functor _® r N preserves colimits, and so does M ® R _, by the basic 
commutativity of tensor products verified in §2.1. In particular, and this is good 
material for another Pavloviau reaction, 

A/ ®r _ and _ N are right-exact functors 

(of. Example 1.18). 

These observations have several consequences, which make ‘computations’ with 
tensor products more reasonable. Here is a sample: 

Corollary 2.6. For all R-modules A/i. A/*. N . 

(Mi M-i) <£’ N = (Mi Sir N) © (Mi c&r N). 
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(Moreover, by commutativity, M ®r (N\ ffi N 2 ) = (M ®/j N\) © (M ®r N?) just 
as well.) Indeed, coproducts are colimits. In fact, ® must then commute with 
arbitrary (possibly infinite) direct sums: 

(0 M a ) 0(M o ®f* N). 

This computes all tensors for free Rr modules: 

Corollary 2.7. For any two sets A, B: 

i?® 4 ®r j?® B = R® AxB . 


Indeed, ‘distributing’ the direct sum identifies the left-hand side with the direct 
sum (R mA ) BB , which is isomorphic to the right-hand side (Exercise III.6.5). For 
finitely generated free modules, this simply says that J?,® m ® H®" = R.® mn . 

Note that if ei , . . . , e m generate M and f\ , . . . , /„ generate N. then the pure 
tensors e;®/, must generate M ®rN. In the free case, if the et’s and // s form bases 
of f?® m , J?.® n , resp., then the mn elements ej ® fj must be a basis for J?® m ® i?® n . 
Indeed they generate it; hence they must be linearly independent since this module 
is free of rank mn. In particular, this is all that can happen if R is a field k and the 
modules are, therefore, just fc- vector spaces (Proposition VI.1.7). Tfensor products 
are more interesting over more general rings. 


Corollary 2.8. For all R-modules N and all ideals I of R, 


R. 

j <2 >rN^ 


N 

IN' 


Indeed, _®p N is right-exact; thus, the exact sequence 

0 >1 >R >j >0 

induces an exact sequence 

/?. 

I®rN >R.® r N >—® r N »0. 


The image of / ® R N in R. ®b N = N is generated by the image of the pure tensors 
a® n with a 6 J, n € N\ this is IN. Thus, the second sequence identifies N/{IN) 
with {R/I) ®k N, as needed. 


Corollary 2.9. For all ideals I, J of R, 


R. R _ R. 

T 0B 7 = 7T7- 


This fellows immediately from Corollary 2.8 and the ‘third isomorphism theorem’, 
Proposition III. 5. 17. Indeed, IR/J = (/+ J)/J- 

Example 2.10. Z/mZ ®z Z/nZ = Z/ gcd(m, n)Z. 

Indeed, (m) + ( n ) = (gcd(m, n)) in Z. For instance, 


2Z ® 2 3Z ~ °’ 


a favorite on qualifying exam# (cf. Exerciwe 2,2). 


j 
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Corollary 2.8 is a template example for a basic application of ®: tensor prod- 
ucts may be used to transfer constructions involving R. (such as quotienting by an 
ideal I) to constructions involving iJ-modules (such as quotienting by a correspond- 
ing submodule). There are several instances of this operation; the reader will take 
a look at localization in Exercise 2.5. 

2.3. Exactness properties of tensor; flatness. It is important to remember 
that the tensor product is not an exact functor: left-exactness may very well fail. 
This can already be observed in the sequence appearing in the discussion following 
Corollary 2.8: for an ideal I of R and an B-module N, the map 

I ®r N N 

induced by the inclusion ICR after tensoring by N may not be iryective. 
Example 2.11. Multiplication by 2 gives an inclusion 

i S - 2 z 

identifying the first copy of Z with the ideal (2) in the second copy. Tensoring by 
Z/2Z over Z (and keeping in mind that R ®r N = N ), we get the homomorphism 

Z 2 } Z 
2Z 2Z’ 

which sends both [0] and [1] to zero. This is the zero-morphism, and in particular 
it is not injective. j 

On the other hand, if N ^ i?.® 4 is free, then _®r N is exact. Indeed, every 
inclusion 

Mj C M 2 

is mapped to Mi ®r f?.® 4 — *• M2 ®r f?® 4 , which is identified (via Corollary 2.6) 
with the inclusion 

M® 4 C Mf A . 

Example 2.12. Since vector spaces are free (Proposition VI.1.7), tensoring is exact 
in fc-Vect: if 

0 > Vj > V 2 ► V 3 ► 0 

is an exact sequence of vector spaces and IV is a k - vector space, then the induced 
sequence 

0 > Vi ®k IV > V 2 ®k W >■ V 3 ®k iv > 0 

is exact on both sides. _i 

The reader should now wonder whether it is useful to study a condition on 
an ^-module IV, guaranteeing that the functor _ ®r N is left-exact as well as 
right- exact. 

Definition 2.13. An i?-module N is flat if the functor _®rN is exact. j 
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In the exercises the reader will explore easy properties of this notion and useful 
equivalent formulations in particular cases. 

We have already checked that Z/2Z is not a flat Z-module, while free modules 
are fiat. Flat modules are hugely important: in algebraic geometry, ‘flatness’ is 
the condition expressing the fact that the objects in a family vary ‘continuously’, 
preserving certain key invariants. 

Example 2.14. Consider the affine algebraic set V(xy) in the plane A 2 (over a 
fixed field k) and the ‘projection on the first coordinate’ y(xy) — ► A 1 , (x,y) x: 



I 

-t- 

0 


In terms of coordinate rings (cf. §VII.2.3), this map corresponds to the homomor- 
phism of fc-algebras: 


fc[x] 


y\ 

{xy) 


defined by mapping x to the coset x •+■ ( xy ) (this will be completely clear to the 
reader who has worked out Exercise VII.2.12!). This homomorphism defines a &[*]- 
module structure on fc[x, y)/(xy), and we can wonder whether the latter is flat in 
the sense of Definition 2.13. From the geometric point of view, clearly something 
‘not flat’ is going on over the point x = 0, so we consider the inclusion of the ideal 
(x) in &[x]: 


fe[x| c x > Afar] 


Tensoring by fc[x, y]/(xy), we obtain 


Ms>yl ■* , fefcy] 

(xj/) (xy) 


which is not injective, because it sends to zero the nonzero coset y+ (xy). Therefore 
y\/( x v) > 8 not fiat as a 1c [a:]- module. 

The term flat was inspired precisely by such ‘geometric’ examples. 


j 
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2.4. The Tor functors. The ‘failure of exactness’ of the functor _ ®r N is mea- 
sured by another functor f?-Mod — * JZ-Mod, called Torf (_, N): if N is flat (for 
example, if it is free), then Torf (M, N) = 0 for all modules M. In fact (amazingly) 
if 

0 >A > B >C >0 

is an exact sequence of /Z-modules, one obtains a new exact sequence after tensoring 
by any N: 

Torf (C, N) > A ®r, N >■ B ®r N > C ®r N 1 0, 

so if Torf {C, N) = 0, then the module on the left vanishes; thus every short exact 
sequence ending in C remains exact after tensoring by AT in this case. In fact 
(astonishingly) for all N one can continue this sequence with more Tor-modules, 
obtaining a longer exact complex: 

Torf (A, IV) -4 “Ibrf (B, IV) -4 Torf (C, N) -4 A ®r N -4 B ®r N -4 C ®a N -4 0. 


This is not the end of the story: the complex may be continued even further by 
invoking new functors Torf (_, N), Torf (_, N), etc. These are the derived functors 
of tensor. To ‘compute’ these functors, one may apply the following procedure: 
given an /Z-module M, find a free resolution (§V1.4.2) 

> j?®®» ¥ i?®5i > #®s 0 > m > 0; 

throw M away, and tensor the free part by N, obtaining a complex M, ®r N: 
> jV®S 2 4 JV©Si > 0 

(recall again that tensor commutes with colimits, hence with direct sums, therefore 
®r N = N mm ); then take the homology of this complex (cf. §111.7.3). As- 
toundingly, this will not depend (up to isomorphism) on the chosen free resolution, 
so we can define 

Torf (M, JV) := Hi(M. ® N). 

For example, according to this definition Torf (M, N) = M ®r N (Exercise 2.14), 
and Torf (M, N) = 0 for all i > 0 and all M if N is fiat (because then tensoring by 
IV is an exact functor, so tensoring the resolution of M returns an exact sequence, 
thus with no homology). In fact, this proves a remarkable property of the Tbr 
functors: if Torf (M, N) = 0 for all M, then Torf (M, N) = 0 for all i > 0 for all 
modules M. Indeed, N is then fiat. 

At this point yon may feel that something is a little out of balance: why 
focus ou the functor _®r N, rather than M ®r _? Since M ®r N is cauouically 
isomorphic to N®rM (in the commutative case; cf. Example 2.2), we could expect 
the same to apply to every Torf: Tbrf (M, N) ought to be canonically isomorphic 
to Torf (N, M) for all i. Equivalently, we should be able to compute Torf (M, N) 
as the homology of M ®r AT,, where N, is a free resolution of N. This is indeed 
the case. 

In due time (§§EX.7 and 8) we will prove this and all the other wonderful facts 
we have stated in this subsection. For now, we are asking the reader to believe that 
the Tor functors can be defined as we have indicated, and the facts reviewed here 
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will suffice for simple computations (see for example Exercises 2.15 and 2.17) and 
applications. 

In fact, we know enough about finitely generated modules over PIDs to get a 
preliminary sense of what is involved in proving such general facts. Recall that we 
have been able to establish that every finitely generated module M over a PID R 
has a free resolution of length 1: 

0 > ft® m i ► ft®"»o ► jlf ► o . 

This property characterizes PIDs (Proposition VI.5.4). If 

0 > A > B >C >0 

is an exact sequence of R-modules, it is not hard to see that one can produce 
‘compatible’ resolutions, in the sense that the rows of the following diagram will be 
exact as well as the columns: 

0 0 0 

I i I 

0 ► ft®°i > ft® fc i > R9ci ► 0 

I' I’ 

0 » ft®°° > .ft® 1 *) > ft® c o ► 0 

I I I 

0 >A >B >C to 

I i I 

0 0 0 

(This will be proven in gory detail in §IX.7.) Tensor the two ‘free’ rows by N; they 
remain exact (tensoring commutes with direct sums): 

0 ► 7V®°» ► N 9b 1 » N 9ci > 0 

aig>N 

0 1 JV®°° > jy®t u > jy&c u > o 

Now the columns (preceded and followed by 0) are precisely the complexes A,(3rN, 
B, N, C , ®r N whose homology ‘computes’ the Tor modules. Applying the 
snake lemma (Lemma III.7.8; cf. Remark III.7.10) gives the exact sequence 


-t H x (. A . N) ► H x (B, ® r N) ► H , (C. ® R N) 


C H 0 (A. 


■S- 


3 


H 0 (A. ® R N) ► Hq(B, ®r N) > H 0 (C, ® R N) 


> 0 , 
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which is precisely the sequence of Tor modules conjured up above, 


> Torf (A, N) •> Torf(B, N) ->■ Torf(C', N) 


• A ®* N ■ 


■B®rN- 


■C®rN- 


with a 0 on the left for good measure (due to the fact that Tor^ vanishes if R is a 
PID; cf. Exercise 2.17). 

Note that Torf vanishes for i > 0 if k is a field, as vector spaces are flat, 
and Torf* vanishes for * > 1 if R is a PID (Exercise 2.17). These facts are not 
surprising, in view of the procedure described above for computing Tor and of the 
considerations at the end of §V3.5.2: a bound on the length of free resolutions for 
modules over a ring R will imply a bound on nonzero Tor’s. For particularly nice 
rings (such as the rings corresponding to ‘smooth’ points in algebraic geometry) 
this bound agrees with the Krnll dimension; but precise results of this sort are 
beyond the scope of this book. 


Exercises 

R denotes a fixed commutative ring. 

2.1. > Let M, N be E-modules, and assume that N is cyclic. Prove that every 

element of M ®r N may be written as a pure tensor. (§2.1] 

2.2. t> Prove ‘by hand’ (that is, without appealing to the right-exactness of tensor) 

that Z/nZ ®z ZjmZ = 0 if m, n are relatively prime integers. (§2.2] 

2.3. Prove that E[ari, . . . , x„] ®r E[j?i y m \ = E[a:i, . . . , x n , y\, . . . , y m ]- 

2.4. -i Let S, T be commutative E-algebras. Verify the following: 

• The tensor product S ®« T has an operation of multiplication, defined on pure 
tensors by (si ® £j) • («2 ® £ 2 ) := s l s 2 ® h ^2 and making it into a commutative 
E- algebra. 

• With respect to this structure, there are E-algebra homomorphism* is : S —> 
S ® T, resp., it : T — * S ® T, defined by is(s) := s ® 1, ir(£) := 1 ® £. 

• The E-algebra S®rT, with these two structure homomorphisms, is a coproduct 
of S and T in the category of commutative E-algebras: if U is a commutative 
E-algebra and fs : S — » U, fr : T — » U are E-algebra homomorphisms, then 
there exists a unique E-algebra homomorphism fs ® f T making the following 
diagram commute: 
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In particular, if S and T are simply commutative rings, then S ®z T is a coproduct 
of S and T in the category of commutative rings. This settles an issue left open at 
the end of §111.2.4. [2.10] 

2.5. t> (Cf. Exercises V.4.7 and V.4.8.) Let S be a multiplicative subset of R, and 
let M be an F-module. Prove that S~ l M ^ M ®/? S~ X R as F-modules. (Use the 
universal property of the tensor product.) 

Through this isomorphism, A/ ®/? S~'R inherits an S -1 i?-module structure. 
[§2.2, 2.8, 2.12, 3.4] 

2.6. -> (Cf. Exercises V.4.7 and V.4.8.) Let 5 be a multiplicative subset of R. and 
let M be an F-module. 

• Let N be an S _I f?-module. Prove that (S -1 A/) 2s-' R N ^ A/ ,V. 

• Let A be an F-module. Prove that (S~ l A) St? A/ = S~ 1 (A Sir A/). 

(Both can be done ‘by hand’, by analyzing the construction in Lemma 2.3. For 
example, there is a homomorphism M ®rN — * (S -1 A/)®g-i /? A r which is surjective 
because, with evident notation, j- 0 ti = rn ® ^ in (S -1 A/) ®s-' H N: checking that 
it is injective amounts to easy manipulation of the relations defining the two tensor 
products. 

Both isomorphisms will be easy consequences of the associativity of tensor 
products: cf. Exercise 3.4.) [2.21, 3.4] 

2.7. Changing the base ring in a tensor may or may not make a difference: 

• Prove that Q S>z Q — Q ®q Q- 

• Prove that C®rC^CScC. 

2.8. Let R be an integral domain, with field of fractions A, and let M be a finitely 
generated f?-module. The tensor product V := M ®/? K is a A'-vector space 
(Exercise 2.5). Prove that dim/c V equals the rank of M as an A-module, in the 
sense of Definition VI.5.5. 

2.9. Let G be a finitely generated abelian group of rank r. Prove that G®zQ — Q r - 
Prove that for infinitely many primes p, G ®z (Z/pZ) = (Z/pZ) r . 

2.10. Let k C k(ct) = F be a finite simple field extension. Note that F F has a 
natural ring structure: cf. Exercise 2.4. 

• Prove that a is separable over k if and only if F F is reduced as a ring. 

• Prove that k C F is Galois if and only if F F is isomorphic to F^ F:k 1 as a 
ring. 

(Use Corollary 2.8 to ‘compute’ the tensor. The CRT from §V.6.1 will likely be 
helpful.) 

2.11. f> Complete the proof of Lemma 2.4. [§2.2] 

2.12. Let S be a multiplicative subset of R (cf. Exercise V.4.7). Prove that S _1 R 
is flat over R. (Hint: Exercises 2.5 and 1.25.) 

2.13. Prove that direct sums of flat modules are flat. 




Exercises 


513 


2.14. > Prove that, according to the definition given in §2.4, Toro (M, ^0 & iso- 
morphic to M N. [§2.4] 

2.15. o Prove that for r 6 J? a non-zero-divisor and N an f?- module, the module 
Torf (B./(r), N) is isomorphic to the r-torsion of N, that is, the submodule of 
elements n e N such that rn = 0 (cf. §VI.4.1). (This is the reason why Tor is 
called Tor.) [§2.4, 6.21) 

2.16. Let /, J be ideals of R. Prove that Torf (f?//,!?/./) S (7nJ)//J. (For exam- 
ple, this Torf vanishes if 1 + J = R, by Lemma V.6.2.) Prove that Torf (R/I, R/J) 
is isomorphic to Torf. 1 (7, B/J) for i > 1. 

2.17. > Let M , N be modules over a PID B.. Prove that Torf (M, N) = 0 for i > 2. 
(Assume M, N are finitely generated, for simplicity.) (§2.4) 

2.18. Let R be an integral domain. Prove that a cyclic U-module is flat if and only 
if it is free. 

2.19. -> The following criterion is quite useful. 

• Prove that an J?- module M is fiat if and only if every monomorphism of B- 
modules A*-* B induces a monomorphism of jR-modules A M <-» B ®j? M. 

• Prove that it suffices to verify this condition for all finitely generated modules B. 

(Hint: For once, refer hack to the construction of tensor products given in 
Lemma 2.3. An element £ 4 a.; ® mi e A ®« M goes to zero in B M if the 
corresponding element £«(“•> m;) equals a combination of the relations defining 
B ®j? M in the free P-module F R (B x M). This will be an identity involving 
only finitely many elements of B; hence ) 

• Prove that it suffices to verify this condition when B = B and A = I is an ideal 
of R. (Hint; We may now assume that B is finitely generated. Find submodules 
Bj such that A = So Q B\ C • • • C B r = B, with each Bj/Bj-i cyclic. Reduce 
to verifying that A ®ij M injects in B ®jj M when B/A is cyclic, hence = R/I 
for some ideal I. Conclude by a Torf argument or — but this requires a little 
more stamina — by judicious use of the snake lemma.) 

• Deduce that an i?-module M is flat if and only if the natural homomorphism 
/ ®fl M — * IM is an isomorphism for every ideal I of R. 

If you believe in Tor’s, now you can also show that an i?-module M is flat if and 
only if Torf (R/I, M) = 0 for all ideals / of R. (2.20) 

2.20. Let R be a PID. Prove that an R-module M is fiat if and only if it is torsion- 
free. (If M is finitely generated, the classification theorem of §VI.5.3 makes this 
particularly easy. Otherwise, use Exercise 2.19.) 

Geometrically, this says roughly that an algebraic set fails to be ‘fiat’ over a 
nonsingular curve if and only if some component of the set is contracted to a point. 
This phenomenon is displayed in the picture in Example 2.14. 

2.21. -■ (Cf. Exercise V.4.11.) Prove that flatness is a local property: an P- module 
M is flat if and only if M r is a flat J? p -module for all prime ideals p, if and only if 

is a fiat R m -module for all maximal ideals tn. (Hint: Use Exercises 1.25 and 2.6. 
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The =h direction will be straightforward. For the converse, let A C B be 17- 
modules, and let K be the kernel of the induced homomorphism A®rM — > B&rM. 
Prove that the kernel of the localized homomorphism ®r u , M m -» B m ®/? m M m 
is isomorphic to K m , and use Exercise V.4.12.) [2.22] 

2.22. -> Let M, N be /?-modules, and let 5 be a multiplicative subset of R. Use the 

definition of Tor given in §2.4 to show' S~ 1 Tor f (Af, N) = Torf S~ l N). 

(Use Exercise 1.25.) Use this fact to give a leaner proof that flatness is a local 
property (Exercise 2.21). [2.25] 

2.23. ▻ Let 

0 >N >P >0 

be an exact sequence of //-modules, and assume that P is hat. 

• Prove that M is Hat if and only if 1 V is fiat. 

• Prove that for all //-modules Q, the induced sequence 

0 ► M ®r. Q > N Q > P ®r Q ► 0 

is exact. 

[2.24, §5.4] 

2.24. -i Let R be a commutative Noetherian local ring with (single) maximal 
ideal tn, and let M be a finitely generated flat /7-module. 

• Choose elements mj , . . . , m T € M whose cosets mod mM are a basis of M/mM 
as a vector space over the field R/ in. By Nakayama’s lemma, M — (mi ,. . . , nv) 
(Exercise VI.3.10). 

• Obtain an exact sequence 

0 >N >/7® r > M 10, 

where N is finitely generated. 

• Prove that this sequence induces an exact sequence 

0 > N/mN > (R/ m)® r > M/mM ♦ 0. 

(Use Exercise 2.23.) 

• Deduce that JV = 0. (Nakayama.) 

• Conclude that M is free. 

Thus, a finitely generated module over a (Noetherian 11 ) local ring is fiat if and only 
if it is free. Compare with Exercise VI.5.5. (2.25, 6.8, 6.12] 

2.25. Let J? be a commutative Noetherian ring, and let M be a finitely generated 
//-module. Prove that 

M is fiat <=> Tbrf (M,/?,/m) = 0 for every maximal ideal m of R . 

13 The Noetherian hypothesis is actually unnecessary, but it simplifies the proof by allowing 
the use of Nakayama’s lemma. 
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(Use Exercise 2.21, and refine the argument you used in Exercise 2.24; remem- 
ber that Tor localizes, by Exercise 2.22. The Noetherian hypothesis is actually 
unnecessary, but the proofs are harder without it.) 


3. Base change 

We have championed several times the viewpoint that deep properties of a ring R 
are encoded in the category R-Mod of R-modules; one extreme position is to simply 
replace R with R-Mod as the main object of study. The question then arises as to 
how to deal with ring homomorphisms from this point of view, or more generally 
how the categories R-Mod, 5- Mod of modules over two (commutative) rings R, 
S may relate to each other. The reader should expect this to happen by way of 
functors between the two categories and that the situation at the categorical level 
will be substantially richer than at the ring level. 


3.1. Balanced maps. Before we can survey the basic definitions, we must up- 
grade onr understanding of tensor products. It turns out that M ®j? N satisfies a 
more encompassing universal property than the one examined in §2.1. Let M, N 
be modules over a commutative ring R, as in §2.1, and let G be an abelian group, 
i.e., a Z-module. 

Definition 3.1. A Z-bilinear map <p : M x N — > G is R-balanced if Vm € M, 
Vn e N, Vr € R, 

ip(rm,n) = <p(m,rn). j 


If G is an R-module and ip : M x N — * G is R-bilinear, then it is R-balanced 12 . 
But in general the notion of ‘•R-balanced’ appears to be quite a bit more general, 
since G is not even required to be an R-module. This may lead the reader to 
suspect that a solution to the universal problem of factoring balanced maps may 
be a different gadget than the ‘ordinary’ tensor product, but we are in luck in this 
case, and the ordinary tensor product does the universal job for balanced maps as 
well. 


lb understand this, recall that we constructed M ®p N as a quotient 
M®pN=— , 

where K is generated by the relations necessary to imply that the map 

p® (MxW) 


M x N -> R*( JWxW ) 


K 


is R-bilinear. We have observed that every element of M N may be written as 
a linear combination of pure tensors: 




12 A1so note that if R is not commutative and Af, resp., N> carries a right-, resp., left-, R 
module structure, then the notion of ‘balanced map* makes sense. This leads to the definition of 
tensor (as an abelian group) in the noncom mutative case. 
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it follows that the group homomorphism 

Z ®(AfxN) ^M®rN 

defined on generators by (m,n) «m®nis surjective; its kernel Kb consists of the 
combinations 

such that n,) € K, 


where the sum on the right is viewed in #©< w * w ) The rea der will verify (Exer- 
cise 3.1) that Kb is generated by elements of the form 

(m, m + n 2 ) - (m, ) - (m, n 2 ), 

(mi +m 2 ,n) - (mi,n) - (m 2 ,n), 

(rm, n) — (m, rn) 

(with m,mi,m 2 € M, n,«i,n 2 6 N, r € R). Therefore, we have an induced 
isomorphism of abelian groups 

2£0(A/xN) A/xN) 

r) Xe " R ’ 

which amounts to an alternative description of M ®r N. The point of this obser- 
vation is that the group on the left-hand side of (*) is manifestly a solution to the 
universal problem of factoring Z-bilinear, R-balanced maps. Therefore, we have 
proved 

Lemma 3.2. Let R be a commutative ring; let M, N be R-modules, and let G be 
an abelian group. Then every Z-bilinear, R-balanced map : M x N — > G factors 
through M<8 >rN; that is, there exists a unique group homomorphism Tp : M ®rN —* 
G such that the diagram 

M x N — - — > G 


commutes. 


M ®r N 


The universal property explored in §2.1 is recovered as the statement that if 
G is an /^-module and <p is A-bilinear, then the induced group homomorphism 
M ®r N — * G is in fact an H-linear map. 

Remark 3.3. Balanced maps <p : M x N — * G may be defined as soon as M is a 
right- R- module and N is a left- R-module, even if R is not commutative: require 
ip(mr,n) = ip(m,rn) for all m € M, n € A^, r g R. The abelian group defined 
by the left-hand side of (*) still makes sense and is taken as the definition of the 
tensor product M ®r N\ but note that this does not carry an .R-module structure 
in general. This structure is recovered if, e.g., M is a two-sided R-module. j 
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3.2. Bimodules; adjunction again. The enhanced universal property for the 
tensor will allow us to upgrade the adjunction formula given in Lemma 2.4. This 
requires the introduction of yet another notion. 

Definition 3.4. Let B, S be two commutative 13 rings. An (5, S)-bimodule is an 
abelian group N endowed with compatible 5-module and 5-module structures, in 
the sense that Vn € JV, Vr e R, Vs € 5, 

r(sn) = s(rra). j 

For example, as R is commutative, every 5-module N is an (5, 5)-bimodule: 
Vr^ , r 2 € B. and Vn G N, 

ri(r 2 n) = (rir 2 )n = (r 2 ri)n = r 2 (r±n). 

If M is an 5-module and N is an (5, S)-bimodule, then the tensor product 
M®rN acquires an S -module structure: define the action of s G S on pure tensors 
m®n by 

s(m® n) :=m® (s«), 

and extend to all tensors by linearity. In fact, this gives M®rN an (5, 5)-bimodule 
structure. 

Similarly, if N is an (5, S)-bimodnle and 5 is an 5-module, then the abelian 
group Homs( IV, P) is an (5, 5)-bimodnle: the 5-module structure is defined by 
setting (m)(») = a(rn) for all r € 5, n € N, and a G Homg(iV, P). 

This mess is needed to even make sense of the promised upgrade of adjunction. 
As with most such results, the proof is not difficult once one understands what the 
statement says. 

Lemma 3.5. Suppose M is an B -module, N is an (5,5) -bimodule, and P is an 
S-module. Then there is o canonical isomorphism of abelian groups 

Homfi(M, Homs(iV, P)) s Homs(Af ®j? N, P). 

Proof. Every element a G Homft(M, Homg(iV, P)) determines a map 

ip : M x N — * P, 

via <p(m,_) := a(m); ip is clearly Z-bilinear. Farther, for all r € 5, m € M, n G N: 

1 2 

<p{rm,n) = a(rm)(n) = ra(m)(n) = a(m)(m) = <p(m, rn), 

1 2 

where = holds by the 5-linearity of a and = holds by the definition of the 5- 
module structure on Horn $(1V, P). Thus <p is 5-balanced. By Lemma 3.2, such a 
map determines (and is determined by) a homomorphism of abelian groups 

tp : M ®r N — * P, 

13 Once more, the non commutative case would be very worthwhile pursuing, but (not without 
misgivings) we hove decided otherwise. In this more general cose one requires N to be both a 
le/t-A-module and a module with the compatibility expressed by (rn)s = r(ns) for all 

choices of n G N, t e R, s e S. This type of bookkeeping is precisely what is needed in order to 
extend the theory to the noncommutative case. 
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such that tp(m,n) = <p(m ® n). We claim that ip is 5- linear. Indeed, Vs € 5 and 
for all pure tensors m ® n, 

Tp(s(m ® n)) = Tp(m ® (sn)) = <p(m, sn) = a(m)(sn) = sa(m)[n) = s<p{m, n ) 

= sip{m ® n), 

where we have used the 5-linearity of a(m). Thus, 

ip € Horns (M TV, P). 

TVacing the argument backwards, every element of Homs(Af ®« TV, P) deter- 
mines an element of Hom/j(AT,Homs(TV, P)), and these two correspondences are 
clearly inverses of each other. □ 

If P = 5, we recover the adjunction formula of Lemma 2.4; note that in this 
case the isomorphism is clearly P-linear. 

3.3. Restriction and extension of scalars. Coming back to the theme men- 
tioned at the beginning of this section, consider the case in which we have a ho- 
momorphism / : R — * 5 of (commutative) rings. It is natural to look for functors 
between the categories P-Mod and 5-Mod of modules over P, S, respectively. There 
is a rather simple-minded functor from 5-Mod to P-Mod (‘restriction of scalars’), 
while tensor products allow us to define a functor from P-Mod to 5-Mod (‘exten- 
sion of scalars’). A third important functor P-Mod — * 5-Mod may be defined, also 
‘extending scalars’, but for which we do not know a good name. 

Restriction of scalars. Let / : P — * 5 be a ring homomorphism, and let TV be 
an 5-module. Recall (§111.5.1) that this means that we have chosen an action of 
the ring 5 on the abelian group TV, that is, a ring homomorphism 

<7 : S - End Ab (TV). 

Composing with /, 

<7 o / : P — * 5 — > End A b(TV) 

defines an action of R on the abelian group TV, and hence an P-module structure 
on TV. 

(Even) more explicitly, if r € P and n € TV, define the action of r on n by 
setting 

rn := f{r)n. 

Since 5 is commutative, this defines in fact an (P, 5)-bimodule structure on TV. 
Further, 5-linear homomorphisms are in particular P-linear; this assignment is 
(covariantly) functorial 5-Mod — * P-Mod. 

If / is injective, so that P may be viewed as a subring of 5, then all we are doing 
is viewing TV as a module on a ‘restricted’ range of scalars, hence the terminology. 
For example, this is how we view a complex vector space as a real vector space, in 
the simplest possible way. 

We will denote by /, this functor 5- Mod — > P-Mod induced from / by restric- 
tion of scalars. Note that /. is trivially exact, because the kernels and images of a 
homomorphism of modules are the same regardless of the base ring. In view of the 
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considerations in Example 1.18, this hints that /» may have both a left-adjoint and 
a right-adjoint functor, and this will be precisely the case (Proposition 3.6). 

Extension of scalars is defined from fl-Mod to 5-Mod, by associating to an R- 
module M the tensor product /* (M) := M ® B S, which (as we have seen in §3.2) 
carries naturally an 5-module structure. This association is evidently covariantly 
functorial. If 

I — > M — * 0 

is a presentation of M (cf. §VI.4.2), tensoring by S gives (by the right-exactness of 
tensor) a presentation of /*(M): 

Intuitively, this says that f*(M) is the module defined by ‘the same generators and 
relations’ as M, but with coefficients in 5. 

The third functor, denoted / ! , also acts from if-Mod to 5-Mod and is yet 
another natural way to combine the ingredients we have at our disposal: if M is an 
ii-iHodule, we have pointed out 14 in §3.2 that 

f'(M) :=Hom r(S,M) 

may be given a natural S-moduLe structure (by setting sa(s') := a(ss / )). This is 
again evidently a covariantly functorial prescription. 

Proposition 3.6. Let f : R-* S be a homomorphism of commutative rings. Then, 
with notation as above, /« is right-adjoint to f* and left-adjoint to f . In particular, 
f, is exact, f* is right-exact, and f' is left- exact. 

Proof. Let M, resp., N, be an R- module, resp., an 5-module. Note that, trivially, 
Homs(5, N) is canonically isomorphic to N (as an 5-module) and to /*(JV) (as an 
ikmodule). Thus 15 

Hom B (M,/.(JV)) = Hom B (jVf, Hom s (5, N)) 

Sf Hom s (M ® B 5, N) = Hom s (/*(M), N) 

where we have used Lemma 3.5. These bisections are canonical 16 , proving that /* 
is Left-adjoint to /*. 

Similarly, there is a canonical isomorphism N =£ N ®s S (Example 2.2); thus 
N ®s S = f*(N) as ^modules, and for every B-module M 

Hom B (/* (N), M) S? Hom B (JV® s S,M) 

S Horner (iV, Hom R (5, M)) = Hom s (JV, f'(M)) 

again by Lemma 3.5. This shows that f' is right-adjoint to /*, concluding the 
proof. □ 


1 'The roles of B end S were reversed in §3.2. 

ls These are isomorphisms of abelian groups, and in fact isomorphisms of A-modulas if one 
applies /« to the Homs terms. 

18 The reader should check this 
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Remark 3.7 (Warning). Our choice of notation, /*, etc., is somewhat nonstandard, 
and the reader should not take it too literally. It is inspired by analogs in the 
context of sheaf theory over schemes, but some of the properties reviewed above 
require crucial adjustments in that wider context: for example /« is not exact as a 
sheaf operation on schemes. j 


Exercises 


In the following exercises, 27, 5 denote commutative rings. 

3.1. > Verily that a combination of pure tensors £; (»** ® n,) is zero in the tensor 
product M N if and only if ^(m*, w*) e 2®( MxjV ) is a combination of elements 
of the form 

(m,ni +n 2 ) — (»», rs-i) — 

(mi + m 2 , n) - (mi, «) - (m 2 , n), 

(rm, n) — (m, rn), 

with m,mi,m 2 € M, n,ni,n 2 e N, r € 27.. [§3.1j 

3.2. If / : R —* 5 is a ring homomorphism and M, N are 5-modules (hence 27- 
modules by restriction of scalars), prove that there is a canonical homomorphism 
of 27-modules M ®fl N —* M ®s N. 

3.3. t> Let 27, 5 be commutative rings, and let M be an 27-module, N an (27, S)- 
biinodule, and P as 5-module. Prove that there is an isomorphism of 27-modules 

M ®r (N ® s P) S (M N) ®5 P. 

In this sense, ® is ‘associative’. [3.4, §4.1] 

3.4. > Use the associativity of the tensor product (Exercise 3.3) to prove again the 
formulas given in Exercise 2.6. (Use Exercise 2.5.) [2.6] 

3.5. Let / : 27 -* 5 be a ring homomorphism. Prove that f' commutes with limits, 
/* commutes with colimits, and /« commutes with both. In particular, deduce that 
these three functors all preserve finite direct sums. 

3.6. Let / : R. — * 5 be a ring homomorphism, and let <p : N\ — * iV 2 be a homo- 
morphism of 5-modules, Prove that <p is an isomorphism if and only if f*(<p) is an 
isomorphism. (Functors with this property are said to be conservative.) In fact, 
prove that /. is faithfully exact: a sequence of S-modules 

0 >L >2V >0 

is exact if and only if the sequence of 27-modules 

0 * ML) 4 MM) * MN) > 0 

is exact. In particular, a sequence of 27-modules is exact if and only if it is exact as 
a sequence of abelian groups. (This is completely trivial but useful nonetheless.) 
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3.7. Let i : k C F be a finite field extension, and let W be an F-vector space of 
finite dimension n. Compute the dimension of i,(W) as a fc-vector space (where i» 
is restriction of scalars; cf. §3.3). 

3.8. Let i : k C F be a finite field extension, and let V he a fc-vector space of 
dimension n. Compute the dimension of i*(V) and i'(V) as F-vector spaces. 

3.9. Let / : R — * S be a ring homomorphism, and let M be an F-module. Prove 
that the extension f*(M) satisfies the following universal property: if N is an S- 
module and <f> : M — » N is an F-linear map, then there exists a unique 5-lniear 
map <p : /*(M) — * S making the diagram 


M — — >N 



r(M> 


commute, where t : M — > f*(M) = M ®rS is defined hymn m®l. (Thus, /*(M) 
is the ‘best approximation’ to the F-module M in the category of S-modules.) 

3.10. Prow the following projection formula: if / : R — ► S is a ring homomorphism, 
M is an F-module, and A r is an 5-module, then /»(/*(M) ®s N) = M f*(N) 
as F-modules. 

3.11. Let / : R. — > S be a ring homomorphism, and let M be a flat F- module. 
Prove that /’(M) is a fiat S-module. 

3.12. In ‘geometric’ contexts (such as the one hinted at in Remark 3.7), one would 
actually work with categories which are opposite to the category of commutative 
rings; cf. Example 1 .9. A ring homomorphism f : R —* S corresponds to a morphism 
f° : S° — » F° in the opposite category, and we can simply define etc., to be 
/*, etc. 

For morphisins f° : 5° -+ i?° and g° : T° — » 5° in the opposite category, prove 
that 

• (f°°g°r = 0 °*°/°*, 

• (f»9 o ) ! =«^r ! , 

where = stands for ‘naturally isomorphic’. (These are the formulas suggested by 
the notation: a * in the subscript invariably suggests a basic ‘covariance’ property 
of the notation, while modifiers hi the superscript usually suggest coutravariauce. 
The switch to the opposite category is natural in the algebro-geometric context.) 

3.13. Let p > 0 be a prime integer, and let ir :Z—> Z/pZ be the natural projection. 
Compute 7r*(A) and 7 t(A) for all finitely generated abelian groups A, as a vector 
space over Z/pZ. Compute f{A) and t (A) for all finitely generated abelian groups 
A, where t : Z <-* Q is the natural inclusion. 

3.14. Let / : R. — * S be an onto ring homomorphism; thus, S = R./I far some ideal 
I of R. 
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• Prove that, for all /{-modules M, f\M) = {m 6 M|Va € J, am = 0}, while 
/*(M) = MjIM. (Exercise IIL7.7 may help.) 

• Prove that, for all 5-modules N, = N and f*f*(N) = N. 

• Prove that /* is fully faithful (Definition 1.6). 

• Deduce that if there is an onto homomorphism R — ► 5, then 5-Mod is equivalent 
to a full subcategory of /{-Mod. 

3.15. Let /:/?—* S be a ring homomorphism, and assume that the functor 
: S- Mod — > /{-Mod is an equivalence of categories. 

• Prove that there is a homomorphism of rings 7} : S —* EndAb (5) such that the 
composition R — * S — * EndAb (#) is the homomorphism realizing R as a module 
over itself (that is, the homomorphism studied in Proposition UI.2.7). 

• Use the fact that 5 is commutative to deduce that g(S) is isomorphic to R. 
(Refine the result of Exercise III.2.17.) Deduce that / has a left-inverse g : S —> 
R. 

• Therefore, /« o g, is naturally isomorphic to the identity; in particular, /* o 
<?*(£) = S as an /{-module. Prove that this implies that g is injective. (If 
a 6 ker <7, prove that a is in the annihilator of /» o g„(S).) 

• Conclude that / is an isomorphism. 

Two rings are Morita equivalent if their category of left-modules are equivalent. 

The result of this exercise is a (very) particular case of the fact that two commutative 

rings are Morita equivalent if and only if they are isomorphic. The commutativity 

is crucial in this statement: for example, it can be shown that any ring R is Morita 

equivalent to the ring of matrices 17 M n ,n(R), for all n > 0. 


4. Multilinear algebra 

4.1. Multilinear, symmetric, alternating maps. Multilinear maps may be 
defined similarly to bilinear maps: if Mi Mi, P are /{-modules, a function 

(p : Mi x ■ ■ • x Me -> P 

is R.-multilinear if it is /{-linear in each factor, that is, if the function obtained 
by arbitrarily fixing all but the i-th component is /{-linear in the i-th factor, for 
i = l,...,r. 

Again it is natural to ask whether /{-multilinear maps may be turned into R.- 
linear maps: whether there exists an /{-module Mi •• - On Mi through which 
every /{-multilinear map must factor. Luckily, this module is already available to 
us: 

Claim 4.1. Every R-muUUinear map Mi x • • • x Me -» P factors uniquely through 

((• • • (Mi Mf) • • ■ ) Mf_i) Mi . 

17 The author was once told that is ‘not seriouriy noncommutative’ since it is Morita 

equivalent to C, which iB commutative. 
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Indeed, argue inductively: if : M\ x ■ • • x Me — * P is multilinear, then for 
every mg 6 Mt, the function 

is /^-multilinear, so it factors through (■ • • (A/j Qr Mi) ®/r ■ • ) A/*_i; therefore, 

induces a unique bilinear map 

((• ■ • (Mi M2) ®r ■ • • ) ®« M<_i) x M( -* P, 
and this induces a unique linear map 

((• • • (Mi ®/j M2) ®/j • • • ) ®/? M(-i) ®/{ M/ — * P 
by the universal property of ®/j. 

Of course the choice of pivoting on the last factor in this argument is arbi- 
trary: any other way to associate the factors would produce a solution to the same 
universal problem. For C = 3 , it follows in particular that there are canonical 
isomorphisms 

(Mi ®/j M2) ®n M3 = Mi ®jj (M2 ®h M3) 
for all P-modules Mi, M2, M3. In this sense, the tensor product is associative 18 
in P-Mod, and we are indeed authorized to use the notation M\ ®/? • ■ • ®« Mg. 
Elements of this module are finite linear combinations 

^mi,i ® -®m f ,i 

I 

of pure tensors. The structure map 

Mi x • • • x M t -» Mi <S>r ■ ®« Mi 

acts as (mi , . . . , mt) •-* mi ® • ■ ■ ® mg. By the multilinearity of this map, we have, 
eg., 

(rmi) ® ■ • ■ ® mg = mi ® ■ ■ • ® (ring) — r(mi ® • ■ • ® mg). 

By convention, the tensor product over 0 factors is taken to be the base ring R. 

Other important variations on the tensoring theme present themselves when 
all the factors coincide. We will use the shorthand notation 


T' fi (M) := M®' = M ®* • • • ®a M 

V V * 

t times 

for the f-fold tensor product of a module M by itself; this is called the (Z-th) tensor 
power of M. For every P-module P, every P-multilinear map M l — > P factors 
uniquely through Tfl(M): 


M e — - — > P 



n(M) 


We set T}j(M) = M\ by our convention, T^(M) = R. 

Now we can impose further restrictions on and again study the corresponding 
universal problems. Popular possibilities are 


l8 In fact, the tensor product is associative in a fancier sense: cf. Exercise 3.3. 
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• ip may be required to be symmetric; that is, for all er € S( and all mi, , m<, 
require that 

vK'«Mi)> nv«)) = <p{mi me)\ 

• or ip may be required to be alternating; that is, 

,..., me) = 0 whenever m * = rrij for some i yf j. 

The reader may have expected the second prescription to read for all er e Se 
and all mi,..., me, require that 

¥K”V(1 )» m a (<) ) = (- 1 )° V>(mi , . . . , m e ) . 

The problem with this version is that if the base ring has characteristic 2, then 
there would be no difference between symmetric and alternating maps. But behold 
the following: 

Lemma 4.2. Let <pi M e — > P be an R-mvUilinear function. 

If ip is alternating, then for oiler € Se, and all mi, . . . , me, 

■ ■ >™e)- 

If 2 is a unit in R, the converse holds as well. 

Proof. For the first statement, it suffices to show that interchanging any tiro fac- 
tors switches the sign of an alternating function (since transpositions generate the 
symmetric group). Since the other factors have no effect on this operation, this 
reduces the question to the case ( = 2. Therefore, we only have to show that if 
ip(m, m) = 0 for all m e M, then 

V?(m 2 ,mi) = -<p(m u m 2 ) 

for all mi, m 2 € M. For this, use bilinearity to expand 0 = <p(mi + m 2 , mj + m 2 ), 
and again use the alternating condition. 

For the second statement, again we can reduce to the t = 2 case. For mi = 
m 2 = m, vs (m 2 , mj) = —<p{m\,m 2 ) says <p(m, m) = —<p(m, m); therefore 

2<p(m, m) = 0. 

If 2 is a unit in R, this implies <p(m, m) = 0, as needed. □ 

Thus, alternating maps do satisfy the (perhaps) more natural prescription, and 
this in fact characterizes alternating maps in most cases. 

4.2. Symmetric and exterior powers. The symmetric, resp., alternating, con- 
ditions come (of course) with companion universal objects: modules S r(M), A %(M) 
( symmetric and exterior power 19 , respectively). The construction of these modules 
follows patterns with which the reader is hopefully thoroughly familiar by now. 
For example, let W C T#(M) be the submodule generated by all pure tensors 
mj ® • • • ® mf such that = m, for some t y£ j\ then 

Km ■■= 


19 The module A l R {M) is also called the C-th wedge power of M. 


Km 

w 
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satisfies the expected universal property for alternating maps, i.e., every multilinear, 
alternating map (p : M ( — * P induces a unique f?-linear map <p: 



A^(M) 


Here, the map A is of course the composition 

M x ■■ • x M - M® • • -® M -* M 0 w ® M = Afl(M), 

and W is the smallest submodule making the multilinear map A alternating. The 
image of a pure tensor is denoted by using A rather than ®: 

mi ® • • • ® mg t— > mj A - - • A mg. 

The reader will verify (Exercise 4.1) that A R (M) may indeed be constructed 
as we have just claimed, and the reader will produce an analogous construction for 
the symmetric power §r(M). 

The module Ar(M) is generated by the pure ‘alternating’ tensors. By Lemma 4.2, 
m2 A mi A m3 = —mi A m2 A m3 

(for example), and, of course, mi A ■ • • A mg = 0 if two of the mg’ s coincide. It 
follows that if ei, . . . , e r generate M, then Ar(M) is generated by all 

Cj, A • • • A 

with 1 < *1 <•••<*<< r. In particular, if M is finitely generated, then A^(M) = 

0 for ( » 0. If M is free, we can be very precise: 

Lemma 4.3. Let R be a commutative ring, and let M be a free R-module of rank r. 
Then A %{M) is o free R-module of rank (^) . 


Proof. There are (J) sequences of indices fy, . . . satisfying 1 < fy <•••<*£< r, 
so we just need to show that the generators e;, A • ■ • A e*, are linearly independent. 
For a fixed 1 = (fy , . . . , ig) with 1 <i\ < ■ ■ ■ < ig < r, consider the map 

<p r .M e ^> R. 


obtained by setting 


• • • > e it) 



if 3 a permutation a such that a(jk) = it, all k, 
otherwise 


and extending by multilinearity. Note that er is unique if it exists, so the prescribed 
value of <pi is well-defined. Also note that since M is free, every element of M 
is expressed uniquely as a combination of the ei’s; hence <pi is well-defined for all 
elements of M e . Further, (pi is evidently alternating (Exercise 4.5). Thus, it factors 
uniquely through an fZ-linear map 

W : Ar — > R , 
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with the property that 


<pi{e h A • • • A e 


Now assume that 


■i<) = { 


(-1) CT if 3 a permutation o such that o{jk) = tfc, all k, 
0 otherwise. 


At, ...i f Ctj A A 0) 

l<*i <■— <ii<r 


then with / = (*i, . ■ . , *r) 


Aj,...,-, = ¥>/ ( 51 A ‘i - ■*' e *. A ■ • • A a, ] = <p/(0) = 0. 


\l<ii<-<ii<r / 

This shows that there are no nontrivial linear dependence relations among the 
generators e*, A • • • A ei, , as claimed. □ 


Note that, in particular, it follows that if M is a free F-module of rank r, then 

A t r M = R; 

an isomorphism is induced by the map 

<Pl...r • M r -+ R 

obtained by setting 

{ ±1 if the indices ii , . . . , t P are distinct, 

0 otherwise, 

where the sign is determined by the permutation ordering the indices. (This is a 
particular case of the maps ipi considered in the proof of Lemma 4.3.) 

Claim 4.4. Let A be anr x r matrix with entries in a field k, and let ai fz k T be 
the i-th column of A. Then with notation as above 

det(j4) = <pi... r (ai,...,a r ). 

This is immediate, since the standard properties of the determinant (obtained 
in §VI.3.2) prove that it is a multilinear, alternating map of the columns and the 
determinant of the identity matrix is 1. For this reason, the top exterior power of 
a free module F (that is, A r R (F) if F has rank r) is called the determinant of F, 
denoted det F. 

More generally, the functions <pi give an explicit isomorphism between A e k R r 

and 

Example 4.5. For V = k 4 , A|V has dimension (j) = 6. On ‘pure wedges’ A 02 , 
the isomorphism A^V — * fc 8 works as follows. View the vectors ai, <12 as the two 
columns of a 4 x 2 matrix A\ then send aj A to the collection of the six 2 x 2 
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minors of A: 




(a\ o£\ 


ajaf — o?02 

a\ 4 


a\4 ~ &i a 2 

«i 4 

HI 

a 2 a| — af<*2 

\ a i 4) 


2 4 4 2 


- a i a h 


This definition extends by linearity to the whole of A j 

Similarly to the alternating case, if e, , . . . , eg generate M, then the symmetric 
power Sjj(M) is generated by the (images 20 of) all tensors 

e, ® ® ef 

with 1 < *i < • • • < it < r. In this case all Sjj(M) are, in general, nonzero. 

4.3. Very small detour: Graded algebra. We have obtained tensor, symmet- 
ric, exterior powers of a module M for every nonnegative integer t. As it happens, 
in each case it is useful to consider ‘all £ at once’. The structures we obtain by 
doing this are particular cases of a very important notion, which would deserve 
much more space than we can allow. 

A graded ring is a (not necessarily commutative) ring S endowed with a de- 
composition 

S=0S f 

i>0 

of the abelian group ( S , +) into a direct sum of abelian groups Si, for nonnegative 
integers 21 i, such that 

Si' Sj C Si+j. 

This prescription is parsed as follows: identify Si with its image in S by the natural 
map; nonzero elements of Si are called homogeneous, of degree *; the condition pre- 
scribes that the product of two homogeneous elements of degree i, j is homogeneous, 
of degree i + j (provided it is not 0). 

Example 4.6. The polynomial ring flfci s„] over any ring R carries a natural 

grading, given by the (ordinary) degree of polynomials. We may write 

fJ[ij,...,a: n ] = i?©(a: 1 ,...,x„)©(a:? J a;ia: 2 ! ---,a:^)©- - • -* 

The reader should have no difficulty providing the notions of graded modules 
and algebras. For example, a graded ring 5 = Sj is a graded algebra over a 
graded ring R = 0 ; i?,- if it carries an action of R. (making it into a ‘conventional’ 
15-algebra) and further Rj ■ Sj c S j+J -. In particular, if R = Rq (so that R is a 
commutative ring -viewed as a graded ring by ‘concentrating’ it in degree 0), we are 
just requiring each graded piece of S to be an 1?- module. 

20 We are not aware of a universally accepted notation to denote ‘symmetric’ tensors; a 
sensible choice would be a simple as in the special case of polynomial rings. 

21 Of course, much more general gradings may be considered, but this is the case that will 
occur in the applications in this section. 
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For a concrete example, the commutative ring i?[xj , . . . , £„] is a graded R- 
algebra, generated (as an algebra over i?) by the piece of degree 1. This is an 
important template for many interesting situations. 

We cannot resist the temptation to mention another example from commutative 
algebra: 

Example 4.7. Let R be a commutative riug, aud let I C R be an ideal. Then the 
direct sum of /?-modules 

0J f = i?©7© J 2 ©/ 3 ©--- 

r>o 

has a natural ring structure, since I 1 ■ P C /*+*. The resulting graded f?-algebra is 
called the Rees algebra of I, Bees« (/), and is of fundamental importance in algebraic 
geometry (as it translates into the notion of blow-up, a particularly important class 
of regular maps). The brave reader will try to explore the R.ees algebra of the ideal 
I = (x,y) in the polynomial ring y] over a field k (Exercise 4.21), which is the 
simplest interesting instance of this construction (well, the second simplest: the 
simplest example may be Exercise IH.5,17). _i 

The additional information carried by the grading on a ring is very substan- 
tial, and the theory of graded rings and algebras is extremely rich. For example, 
projective algebraic geometry may be developed by using graded rings along the 
same lines we sketched in §VTI.2.3 for the aiiine case. As we mentioned at the 
end of §VII.2.3, this provides a ‘globalization’ procedure which leads to important 
technical advantages. 

For the (very modest) purpose of this section, the following general remarks 
are all we need. 

Graded rings form a category under ring homomorphisms, but they form a 
more interesting one if we require the homomorphisms to preserve the grading; 
that is, if S = Si and T = © ( T< are graded rings, consider ring homomor- 
phisms ip : S —* T such that 'p(Si) C 2). Such homomorphisms are called graded 
homomorphisms. Of course, singling out a special class of homomorphisms will 
then single out a special class of ideals, that is, those ideals which appear as kernels 
of graded homomorphisms. 

Definition 4.8. An ideal I of a graded ring S = © s Si is homogeneous if / = 

©i(/n5i). 

Lemma 4.9. Let S = © f Si be a graded ring and let I C S be an ideal of S. Then 
the following are equivalent: 

(i) I is homogeneous; 

(ii) if s 6 S and s = ^ s* is the decomposition of s into homogeneous elements 
Sid S ir then s6 / <=> s,- 6 I for aU i; 

(iii) I admits a generating set consisting of homogeneous elements; 

(iv) I is the kernel of a graded homomorphism. 
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Proof, (i) <=$• (ii) is the very definition of homogeneous ideal; (ii) (iii) is left 
to the reader (Exercise 4.10). 

(ii) =$■ (iv): Assuming (ii) holds, define a grading on S/I by letting the piece 
of degree i consist of (0 and) the cosets of the elements of degree i in S. This is 
well-defined by (ii), and it is immediately checked that it induces a graded ring 
structure on S/I. The ideal I is then the kernel of the graded homomorphism 
S —* S/ 1, verifying (iv). 

(iv) => (ii): Let <p : S —* T be a graded homomorphism, and assume I — ker <p. 
Let s = Yli e i € kerp, with S{ € Si- Then Yli v( 8 i) = 0 in T, and <Pi(si) is 
homogeneous of degree i for all i. It follows that tp(8i) = 0 for all i, that is, each 
Oi £ ker <p — I, implying (ii). □ 

Note that a graded homomorphism <p : S —* T induces a homomorphism of 
abelian groups ip( : Si — * T{ for each i. The ideal ker <p is then the direct sum of all 
ker tpi. 

Example 4.10. The ideal I = (y — x 2 ) is not homogeneous in the ring fc[x, y], 
if this is given the grading by the usual degree: indeed, y - x 2 € I while y & I, 
contradicting condition (ii) of Lemma 4.9. The ideal (y, y - x 2 ) is homogeneous, 
since it equals (y, x 2 ) and both y, x 2 are homogeneous. 

The conventional grading of a polynomial ring is not the only option: we could 
decide to grade k\x, y\ by placing constants in degree 0 and assigning degree 2 to 
the indeterminate x and degree 1 to y. With such a grading, the ideal (y — x 2 ) is 
homogeneous. j 

Projective algebraic geometry (almost) follows the blueprint of affine algebraic 
geometry reviewed in §VII.2.3, but using only homogeneous ideals in fe[xi, . . . ,x„] 
(taken with the usual grading). 

4.4. Tensor algebras. Going back to the context of multilinear algebra, consider 
again a module M over a commutative ring R. We define the tensor algebra of M 
as the graded f?-algebra 

TJr(M) :=0Ti(M): 

f>0 

the multiplication is defined on pure tensors by 

(mi ® ■ • • ® mj) • (ni ® • ■ • ® »ij) := mi ® • • ■ ® m* ® ® • • • ® nj 

and is extended by linearity. Similarly, we can define the symmetric algebra and 
the exterior algebra of M : 

:= 0S<(M), A **(M) := 0 A e R (M). 
e>o e>o 

Remark 4.11. The tensor algebra is not commutative; the symmetric algebra is 
commutative; and the exterior algebra is ‘skew-commutative’ in the sense that if 
a € A’b.(M) and 0 € A^(M), then 

a A fi = (— 1 Y*j3 A a 
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(Exercise 4.16), where we use A to denote the operation in A* R (M). The reader 
should recognize this formula from the theory of differential forms; differential forms 
provide perhaps the most important example of an exterior algebra. j 


The definitions of symmetric and exterior powers were concocted so as to yield 
surjective graded homomorphisms of algebras 

TJj(M) - SJ(M), 

T«(M) - 

As we have learned in Lemma 4.9, the kernels of these homomorphisms are certain 
homogeneous ideals of the tensor algebra; these ideals must then be generated by 
homogeneous elements. 


Lemma 4.12. Let Is, 1^ C T* R (M) be the ideals respectively generated by all ele- 
ments of the form (m®ra — n®m) as m, n 6 M and by elements of the form 
as me M. Then 


&r(M) = 


Is 




nm 

'A 


as graded R -algebras. 


Proof. We have observed in §4.3 that the kernel of a graded homomorphism is 
the direct sum of the kernels of the induced homomorphisms in each degree; the 
statement of the lemma then follows easily from the explicit descriptions of the 
kernels of the canonical projections from the tensor powers to the symmetric and 
exterior powers, given in §4.2. □ 

Remark 4.13. In some cases it is also possible to construct subalgebras of TJ.(M) 
isomorphic to S^(M) and A For example, tensors that are invariant under 
the action of the symmetric group may be used to give a copy of SJj(M) inside 
T* r (M): + m 2 ® mi) would be such a ‘symmetric tensor’. However, the 

introduction of denominators poses distasteful restrictions on the base ring R, and 
for this reason we prefer the ‘quotient’ constructions of Lemma 4. 12. -i 

It is now straightforward to establish universal properties satisfied by the alge- 
bras defined above. Note that by construction there are canonical isomorphisms of 
M with T}j(M), S)j(Af), A l r (M), and in particular there are canonical maps 

M -* T R (M), M — » S R (M), M — *• A R (M). 

Proposition 4.14. Let R be a commutative ring, and let M be an R.-module. Then 
for every R-algebm T and every R.-module homomorphism <p : M —*T there exists a 
unique homomorphism, of R.-algebras Tp : T)j(M) — > T making the following diagram 
commute: 

M — - — > T 
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Proposition 4.15. Let if be a commutative ring, and let M be an R-module. Then 
for every commutative R-algebra S and every R-module homomorphism ip \ M —* S 
there exists a unique homomorphism of R.-algebras ip : Sj,(M) — » S making the 
following diagram commute : 



S %(M) 


Proposition 4.16. Let R be a commutative ring, and let M be an R-module. 
Then for every R-algebra A and every R.-modvle homomorphism A : M — * A such 
that A (to) 2 = 0 Vm e M, there exists a unique homomorphism of R.-algebras A : 
A)j.(M) — * A making the following diagram commute: 


M 



Aft' 


M) 


Details may now safely be left to the reader (who may for example establish 
the first proposition from the universal property of tensor powers and then deduce 
the second and third from Lemma 4.12). 

Example 4.17. The free case is particularly easy to understand. For instance, 

S\(R *,]• 

Indeed, the polynomial ring satisfies the appropriate universal property with respect 
to mapping to commutative rings (cf. §111.2.2 and §111.6.4). Likewise, T^(if® r ) 
should be thought of as a ‘noncommutative’ polynomial ring, in which the r inde- 
tenniuates do not commute with each other (cf. §111.6.3). -i 

Of course the constructions TJj, SJj., are all iunctorial, and their behavior 
with respect to exact sequences is interesting and important. For example, suppose 

0 >L >M > N >0 

is an exact sequence of ff-modules. Then L may be identified with a subset of 
M = S}j(M); hence it defines an ideal L ■ Sip(M) of the algebra SJ;(M). It is not 
hard to show that the sequence 22 

0 > L • §fi _1 (M) >• S m x (M) + S * r (N) > 0 

is exact. This is often useful in computations (cf. Exercise 4.21). We will leave any 
further such explorations to the more motivated readers. 


22 The shift S* 1 is introduced in order to preserve degrees in this sequence. 




532 


VIII. Linear algebra, reprise 


Exercises 


In the following exercises, B. denotes a commutative ring and k denotes a field. 

4.1. t> Verify that the module A^(Af) constructed in §4.2 does satisfy the universal 
property for alternating multilinear maps. Construct a module S e R (M) satisfying 
the universal property for symmetric multilinear maps. [§4.2] 

4.2. Define the action of T^, S^, A/? on F-linear maps, making covariant functors 
out of these notions. 

4.3. Let I he the ideal (x,y) in so every element of I may be written (of 

course, not uniquely) in the fonn fx+ gy for some polynomials f,g€ k[x, y}. Define 
a function <p : I y. I —* k by prescribing 

<p(hx + giy,hx + 92y) ■= /i(o,o)p2(o,o) -/2(o,o)gi(o,o). 

• Prove that <p is well-defined. 

• Prove that tp is j»]-bilinear and alternating. 

• Prove that ^ 0. 

Note that / has rank 1 as a k[x, p]-module; if it were free, its wedge with itself 
would have to vanish. 

4.4. Let Fi and F$ be free i?-modules of finite rank. 

• Construct a ‘meaningful’ isomorphism det(Fi) ® det(F 2 ) — det(Fi © F 2 ). 

• More generally, prove that 

Ar(Fi ® ^2) = @ ( a h^i) (AiF 2 ). 

i +J=r 

4.5. > Verify that the multilinear map <pj defined in the proof of Lemma 4.3 is 
alternating. (§4.2] 

4.6. Let V be a vector space, and let vi,...,v< € V. Prove that iq,...,v* are 
linearly independent if and only if in A • • ■ A vg 0. 

4.7. -• Let V be a k-vector space, and let {v \ , . . . , vg}, {nq, . . . , w<} be two sets of 
linearly independent vectors in V. Prove that {n, . . . , ur), {uq , . . . , wt } span the 
same subspace of V if and only if v t A • • -Avg and w\ A • • -Awg are nonzero multiples 
of each other in A^V. (For the interesting direction, if {«q, . . . , vg) (uq , . . . , wg), 
there must be a vector w belonging to the first subspace but not to the second. 
What can you say about (tq A • • ■ A ur) A u and (ruj A • ■ • A wg) A ti in AJ VI) 

Deduce that there is an injective function from the Grassmannian of £-di- 
mensional snbspaces of V (Exercise VI.2.13) to the projective space P(A|V): the 
Grassmannian is identified with the set of ‘pure wedges’ in the projectivization of 
the exterior power A*F. This is called the Pliicker embedding of the Grassmannian. 
[4.8] 
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4.8. The Pliicker embedding described in Exercise 4.7 realizes the Grassmannian 

Gr^(2,4) of 2-dimensional subspaces of A: 4 as a subset of the projectivization of 
A |fc 4 = kP: Grjc(2, 4) C P|. Choose projective coordinates (Exercise VII.2.20) 
(*12 : : *14 : *23 : *24 : *34) in P(A|fc 4 ), listed according to the corresponding mi- 

nors, as in Example 4.5. Prove that Grfr(2, 4) is the locus of zeros 

^(*12*34 ~ *13*24 + *14*23) 

(notation as in Exercise VII.2.21). (Remember that you know how to write every 
point of Grj.(2,4): look back at Exercise VI.2.14.) 

Thus, Gr|,.(2, 4) may be viewed as a projective algebraic set. In fact, all Grass- 
manniaus may be similarly realized as projective algebraic sets (in the sense of 
Exercise 4.11) via the corresponding Pliicker embeddings. 

Prove that Gr*(2,4) may be covered with six copies of A£. (Recall from Exer- 
cise VII.2.20 that Pf may be covered with six copies of A®; prove that the intersec- 
tion of each with Gr*(2,4) may be identified with A* in a natural way.) 

4.9. Assume 2 is a unit in R, and let F be a free R-module of finite rank. 

• Define a function A : A^(F) — ► T^(F) on a basis ej A ej, i < j, by setting 
A(ei A ej) = j(ei ®ej — ej ® ej) aud extending by linearity. Prove that A is an 
injective homomorphism of R-modules and A(/i A/2) = |(/i 0/2 — f'i® fi) for 
all /1./2 € F. 

• Define a function <7 : S^(F) -» Tfj(F) on a basis e; ® ej, i < j, by setting 
ff(e, ® ej) = |(ej ®ej+ ej ® ej) aud extending by linearity. Prove that a is an 
injective homomorphism of F-modules and <r(/i ® fi) = |(/i ® h + h ® f \ ) for 
aU/i,/ 2 eF. 

• Prove that A identifies Ajj(F) with the kernel of the map T^(F) — ♦ Sr(F) aud 
a identifies S^(F) with the kernel of the map T%(F) — ► A %(F). 

In particular, there is a ‘meaningful’ isomorphism F ®p F = A|j(F) © S^(F). 

4.10. > Prove the equivalence (ii) 4=4- (iii) in Lemma 4.9. [§4.3J 

4.11. -1 Let I C fc[* 0 , . . . , av] be a homogeneous ideal. Prove that the condition 
‘F(cq, . . . , c„) = 0 for all F 6 /’ for a point (cq : . . . : Cn) € P£ is well-defined: it 
does not depend on the representative (co, ...» c^) chosen for the point (co : . . . : c„). 
(Cf. Exercise VII.2.21, but note that not all F in I are homogeneous.) 

Thus, every homogeneous ideal I C fc[* 0 , detennines a ‘projective alge- 

braic set’ 

r(J) := {(CD :...:(%,)£ PS- 1 (VFe 7), F(cD,...,Ci,) = 0}. 


Note that T((*o, ■ ■ . , *«)) = 0. The ideal (*0i • • • ) *n) = ©fc[*0t • • • . *n]i is 

i>0 


irreverently called the irrelevant ideal. [4.8, 4.12] 


4.12. (Cf. Exercise 4.11.) Prove the ‘weak homogeneous Nullstellensatz’: if k is 
algebraically closed and / C ...,*»] is an ideal, then ’F(I) = 0 if and only if 
sTl is either fcjzo. .... * n ) or the irrelevant ideal (xo,...,x„). (TVanslate this into a 
question about Aj! +1 .) 
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4.13. Let k be a field of characteristic zero. A differential operator in one variable, 
with polynomial coefficients, is a linear combination 

(*) no(ir) + 01 (x)d x H + a r (x)dl 

where a; (a;) e fc[z]. Here x acts on a polynomial f(x) e &[z] by multiplication 
by x, while d x acts by taking a formal derivative (as in §VII.4.2). With the evident 
operations, differential operators form a ring, called the (first) Weyl algebra. Note 
that the Weyl algebra is noncommutative: for example, 

(xd x )f(x) = xf'(x), 

while 

(d x x)f(x) = d x (xf(x)) = f(x) + xf'(x). 

Therefore, (d x x — xd x )f(x) = f(x), or put otherwise 
(**) d x x - xd x = 1 

in the Weyl algebra. Prove that the Weyl algebra is isomorphic to the quotient 
TJ.( (a - , y))/(yx — xy — 1). (Use (**) to write any element of the Weyl algebra in the 
form given in (*). Show that this representation is unique, and deduce that (**) 
generates all relations among x and 8 X . Then use the first isomorphism theorem.) 

The relation (**) expresses (up to a factor) the basic fact that in quantum 
mechanics the position and momentum operators do not commute. The Weyl 
algebra was introduced to study this phenomenon. Modules over rings of differential 
operators are called D-module$. 

4.14. Let F be a free F- module of rank r. Prove that Sfj(F) is free, and compute 
its rank. 

4.15. Let F\, 1*2 be free 17-modules of finite rank. Prove that SJj(Fi © F%) = 

S&(fi)®i*SJ,(F a ). 

4.16. > Verify the skew commutativity of the exterior algebra, stated in Re- 
mark 4.11. [§4.4j 

4.17. Let V be a fc-vector space of dimension r. Prove that, as a vector space, the 
exterior algebra A*(V) has dimension 2 r . 

4.18. -> Prove that the Rees algebra Rees r(I) of an ideal I is Noetherian if R is 
Noetherian. [4.19] 

4.19. -i Let 17 be a Noetherian ring, / an ideal of 17, and consider the Rees algebra 
Rees r(I) - I e . By Exercise 4.18, Reesi?(/) is Noetherian. 

• For £ > 0, let Je C I e be ideals of F, and view J := ® £ > 0 as a »ub-F- module 
of Rees r(I). Prove that J is an ideal of Rees/j(7) if and only if I n Jt C J (+n for 
all £, n > 0. 

• Assume J := © £>0 Je is an ideal of ReesR(f). Prove that J admits a finite set 
of homogeneous generators. 

• Choose a finite set of homogeneous generators for J, and let s be the largest 
degree of an element in this set. Prove that J 4+i C 7® +1 J 0 + I s J\ + •■•-(- IJ a 
(as ideals of R). 
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• Prove that = I e J t for all £ > 0. 

This is a particular case of the ‘Artin-Rees’ lemma. [4.20] 

4.20. -i Let U be a Noetherian ring, and let I be an ideal of R. Prove that 
J-rUoJ” =0,>o-r n - (Hint: Exercise 4.19.) [1.18] 

4.21. t> Let k he a field, and consider the ideal I = (x, y) in the ring R = k\x,y\. 
Prove that the Rees algebra 0j >o P is isomorphic to the quotient of a polynomial 
ring k[x, y, s, t] by the ideal (tx — sy). Deduce that, in this case, the Rees algebra 
of I is isomorphic to the symmetric algebra S fcb,y|( / )' 

(For the last point, use the exact sequence mentioned at the end of §4.4. It will 
be helpful to have an explicit presentation of I as a fcfx, j/]-module; for this, looking 
back at Exercise VI.4.15 may help.) [§4.3, §4.4] 

4.22. -i Let a denote a list a lt of elements of f?, and let F = R n . Rename 

Aft(F) by K r (a). Define F-module homomorphisms d r : K r (a) — > K r - i(a) on 
bases by setting 

r 

d(e h A • ■ • A ej r ) = A ■ • • A A • • • A e ir , 

i=i 

where the hat denotes that the hatted element is omitted. 

• Prove that dr - 1 o d r = 0. 

Thus, a collection aj , . . . , a„ of elements of R determines a complex of ^modules 

0 + K n (£-^---^K 1 (a)-^Ko(a) = R >R./I >0, 

where I = («i, . . . , ct n ). This is called the Koszul complex of a. 

• Check that the complexes constructed in Exercises VI.4.13 and VI.4.14 are 
Koszul complexes. 

As proven for n = 2,3 in Exercises VI.4.13 and VI.4.14, the Koszul complex is 
exact if (at, . . . , a„) is a regular sequence in R, providing a free resolution for R/I 
in that case. Tty to prove this in general. [VI.4.14] 


5. Horn and duals 

Our rapid overview of tensor products has occasionally brought us into close prox- 
imity with the Horn functors, and we will close this chapter by devoting some 
attention to these functors. As in the case of tensors, we will be preoccupied with 
adjunction and exactness properties, since concrete computations depend heavily 
on these properties. We will deal with these properties more carefully in the next 
(and last) section; in this section we will concentrate on one important special case 
of Horn, that is, the duality functor. 

As in the rest of the chapter, R denotes a fixed commutative ring. 
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5.1. Adjunction again. The careful reader will agree that almost everything 
we know about the tensor product follows from the fact that it is a left-adjoint 
functor. This fact is spelled out in Lemma 2.4, whose flip side gives us just as much 
information concerning the covariant flavor of the Horn functor: 

L »-» HoniR.Mod(A, L). 

Explicitly, the following is an exact restatement of Corollary 2.5: 

Corollary 5.1. For every R-module N, the functor Horn r(N,_) is right-adjoint 
to the functor _<8 >rN . 

What about the ‘contravariant’ flavor of Horn: 

M ~ h N (M) := Hom /? . M od(A/, N) 

(this functor was discussed in general terms in §1.2)? 

Proposition 5.2. For every R-module N, the functor Hom/}(_, N) is right-adjoint 
to itself. 

This statement should be parsed carefully, because Honi/j(_, IV) is a contravari- 
ant functor. The statement of Proposition 5.2 may lead us astray into thinking that 
Honifl(_, N) must also be its own left-adjoint, and this is not the case (indeed this 
would make it a right-exact functor, and we will soon see that Hom/}(_, JV) is not 
right-exact in general). 

The point is that, by definition of contravariant functor, = Hom/e(_, N) 
should be viewed as a covariant functor from the opposite category: 

h N : R- Mod op - R- Mod; 

it can also be viewed as a covariant functor to the opposite category, 

h% : R-Mod -* R-Mod op , 

by simply reversing arrows after the fact rather than before. A more precise state- 
ment of Proposition 5.2 is that 

hN is right-adjoint to h°ff. 

Proof. Let L, M, N denote R-modules. Recall (cf. the considerations preceding 
Lemma 2.4) that R-bilinear maps 

<p: Lx M -*N 

may be identified with /?-linear maps 

L — Hom fi (M, iV). 

By the same token, they may be identified with R-linear maps 

M — Hom fi (L, N ) : 

for fixed m € M, the function y>(_, m) is an R-linear map L —> N. Tracing these 
two identifications gives a canonical bijection 

(*) Hom,j(L, Hom fl (A/, N)) ££ Hom R (M, Hom H (L, N)). 
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We are adhering to the convention that Hohir stands for HomR.Mwt. We may view 
it just as well as HomR.Modop, provided that we reverse arrows; thus, the canonical 
bijection in (*) may be rewritten as 

Houir.moj (L, HoniR„Mcd (M, N)) ss HomR.MoS”»(HouiR„ Mo< (<.r(jV, L), M) 

or, using the notation introduced before the proof, as 

HomR-ModOk. hN(M)) = Homfl.wod < ‘»'(hjv(£)> M). 

This says that Hn is right-adjoint to h°^ (the naturality requirement is also imme- 
diate, and as usual it is left to the reader). □ 

As both Homji(jV,_) (by Corollary 5.1) and Horn r(_,]V) (by Proposition 5.2) 
are right-adjoint functors, Lemma 1.17 tells us they commute with limits. This fact 
must also be parsed carefully, because of the contravariant nature of Hom R (_, N): 
for example, products are limits in f?-Mod, but direct sums are colimits in J?-Mod, 
hence limits in J?-Mod <,p . Thus, 

Corollary 5.3. For every R-module N and every family of R-madules, 

Horn r(N, I] Mi) s n HomR(AT, M*), 

ie/ i£i 

Horn ft(0Mi,AO 

tel »€/ 

Also, as pointed out in Claim 1.19, the fact that both the covariant and con- 
travariant flavors of Horn are right- adjoints has immediate implications for their 
exactness, adding yet another Pavlovian statement to the list: 

Hom R (M,_) and Hom R (_, N) are left-exact functors. 

By the contravariant nature of Hom/j(_, N), the left-exactness of the latter 
means that if 

A >B >C >0 

is an exact sequence of if-modules, then the induced sequence 

0 >• Hom K (C, N) » Hom R (S, N) * Hom*(A, N) 

is also exact. The diligent reader has verified this already ‘by hand’ in the distant 
past (Exercise III.7.7), without appealing to adjunction. Direct proofs of the left- 
exactness of both Horn functors are s-traightforward and instructive. 

5.2. Dual modules. We will explore further the exactness (and lack of exactness) 
of Horn in §6. Before doing that, however, we want to examine one important 
particular case of the contravariant aspect of the Horn functor. 

Definition 5.4. Let M be an /7-module. The dual M v of M is the JJ-module 
Hom*(M,/7). 

One use of the dual module is to translate Horn computations into ® compu- 
tations, at least in special cases. 
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Proposition 5.5. Let M be any R-module, and let F be a free R-modvle of finite 
rank. Then 

Horn s (M,P)SM v 0 R f. 

Proof. By hypothesis F S£ 2 i?. n ; hence 

Hom*(M,jF)S Hom*(M,,R’‘)£Hom<M,.R)’‘ S Hom(M, fl) R n & M v ®n F, 
by Corollary 5.3. □ 

In fact, note that for all f?-modules M, N there is a natural ‘evaluation’ map 
e : M v N -> Homa(M, N) 

defined on pure tensors by mapping / <8>n (for / e Hoin«(M, R) and n € N) to the 
J?-module homomorphism M —* N given by 

m i-» f(m)n. 

The reader will check (Exercise 5.5) that e is an isomorphism if N is free of finite 
rank; this gives a more precise version of Proposition 5.5. 

5.3. Duals of free modules. By definition, the ‘duality’ functor M >— * M v is a 
particular case of the contravariant flavor of Horn; hence it is itself contravariant 
and commutes with limits. Here is a first, immediate consequence: 

Lemma 5.6. For every family {Mi} of R -modules, = []» ■ 

Specializing to the case in which Mi = R for all i determines the dual of a free 
module: 

Corollary 5.7. The dual of a free module is isomorphic to a product of copies 
of R: 

[RF> s Y & R. s . 

In particular, (R”) v = R n : if F is a free B-module of finite rank, then F v = F. 

Proof. This follows from Lemma 5,6 and the fact that J? v = Hom/j(f?., R) is iso- 
morphic to R. □ 

Carefully note the magically disappearing © from the left-hand side to the 
right-hand side in the statement of Corollary 5.7: direct sums become direct prod- 
ucts through the contravariant Horn (Corollary 5.3). However, a direct product of 
finitely many modules happens to be isomorphic to their direct sum (as we have 
known for a long time: Proposition III.6.1), hence the second part of the statement. 

Remark 5.8. If S is infinite, R, s is ‘much larger’ than R® s : the first module 
consists of all functions S —* R, while the second only retains those that are zero 
for all but finitely many s6S, _i 

Remark 5.9. Note that the set S is not determined by the isomorphism class 
of a free module F = B® s , We proved in Corollary VI.1.11 (if R is an integral 
domain) that the cardinality of S is determined by the isomorphism class of F, but 
of course S itself, say as a subset of F. is not. In fact, recall (§VI.1.2) that the 
choice of a specific isomorphism F = is equivalent to the choice of a basis 
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of F. Now, the isomorphism appearing in Corollary 5.7 requires knowledge of S\ 
indeed, we will verify in a moment (Example 5.11) that this isomorphism, even in 
the flnite case, does depend on the choice of a basis. It is not canonical! The reader 
should endeavor to remember this slogan: a finite-rank free module — for example, 
a finite-dimensional vector space — is isomorphic to its dual, but not canonically, j 


This remark can be clarified by means of the following notion. 


Definition 5.10. Consider the standard basis (ei, . . . ,e„) of R n . The dual basis 
of {R n ) v = R n consists of (ei,...,e„), where e* € (f?") v = Homp(R”,R) is 
determined by 


J1 if 7 = i > 

e u e 3V | q otherwise. 


J 


The isomorphism (f?,”) v = R” of Corollary 5. 7 is obtained precisely by match- 
ing e» and e»: this will be crystal clear to the reader who works out Exercise 5.7. 
In particular, the vectors e; do form a basis of (/? n ) v . 

Example 5.11. To see that these isomorphisms do depend on the choice of the 
basis, consider the standard basis (ei,e 2 ) of R? and the corresponding dual basis 
(ei,e 2 ) of (f? 2 ) v , and then choose a different basis (e'j, e^) for iZ 2 , where e'j = ej 
and = ei + e 2 , and let (e^ej) be the corresponding dual basis. By definition 

®X ( e 2) = Oi 

while 

«l( e 2) = ®i(®i + © 2 ) =1+0=1. 

Therefore, e ( , even ife\ = ej: the two isomorphisms J? 2 = (J? 2 ) v determined 
by the two bases are different. _i 


The fact that duality leads to uoncanonical isomorphisms is somewhat unpleas- 
ant, but something magical will happen soon (§5.5): we will recover a canonical — 
that is, independent of any choice — isomorphism (on finite-rank free modules) by 
applying the duality functor twice. Whatever twist is introduced by the duality 
functor wall be untwisted by applying duality again. 


5.4. Duality and exactness. Before seeing this, let 11 s contemplate another im- 
mediate consequence of the fact that duality is a particular case of the coutravariant 
Horn: 

Lemma 5.12. The duality functor is left-exact every exact sequence 

L- — >M- -+ N >0 

of R-modules induces an exact sequence 

0 >N V >M V >L V . 


Proof. This is an immediate consequence of the left-exactness of Horn. 


□ 
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The duality functor is not exact in general: for example, taking duals in the 
exact sequence of abelian groups (a.k.a. Z-modules) 

0 )-Z—^Z »Z/2Z >0 

gives the sequence 

(*) 0 >0 >Z V — ?->z v >0 

since the dual of Z/2Z is zero (Exercise 5.6). The map 7 in this sequence is the 
dual of the multiplication by 2. If / : Z -* Z is an element of Z v , then y(f) is 
obtained by composition: 


Z — 2 -*z 



z 


By linearity, 7 (/)(n) = /(2n) = 2/(n) is even for all n e Z. In particular, the 
identity Z — » Z (as an element of Z v ) is not in the image of 7. Thus 7 is not 
surjective, and the sequence (*) is not exact. 

There are situations in which duality is exact; we will understand this better 
after digesting §6, but the following observation will suffice for now. 

Proposition 5.13. Let 

0 >M— lL ->N— "—>P >0 

be an exact sequence of R-modules, with P free. Then the induced sequence 
0 > P v N v M v >0 

is exact. 

This is also not new to the diligent readers, as it is a particular case of Exer- 
cise III.7.8 and, further, it is a consequence of Exercise 2.23. But here is a direct 
argument: 

Proof. Lemma 5.12 takes care of all but the surjectivity of the map N v — » M v 
induced from M —* N: 


M —^—>N 



R 


The question is whether every J?-linear / : M — » R cau be extended to an U-linear 
map g : N — > R so that f = g op, that is, / = p v (p). 

As P is free, P = F R (S) = /?® s for some set S. Choosing (arbitrarily!) 
preimages in M of the standard basis vectors e„ e R 9S gives a set-function S — * M, 
extending to an i?-linear map p : P — » M by the universal property of free modules: 
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By construction, vo p — idp. Now let n 6 N. Then v(p(u(it))) = v(n), giving 
v(n - p(u(n))) = 0, By the exactness of the given sequence, 3/n € M such that 

n- p(v(n)) = p(m). 

The element, m is uiiiquc, since p is injective. Setting 

</(») ~ /(mi) 

gives the necessary extension. Indeed, g is immediately checked to be f?-linear. and 
g(p(rn)) = f(in) by definition. □ 


In particular, 

Corollary 5.14. The duality functor is exact on vector spaces. 

5.5. Duals and matrices; biduality. Even without the benefit of full exactness, 
Lemma 5.12 and Corollary 5.7 reduce (in principle) the computation of the dual of 
any finitely presented module to matrix calculus. If 

R“ — R m > M ► 0 

is a presentation of an /^-module A/, then the dual A/ v is identified with the kernel 
of the dual of a: 

0 > A/ v * R m R". 

Here we are applying isomorphisms (f?"') v = R' n . (/?") v = R" from Corollary 5.7. 
The map « v is obtained by applying the functor Hoin/?(_, R) to the map o. Once 
bases are chosen, a is determined by an m x n matrix; likewise, a v corresponds 
to an n x m matrix. What is the relation between these two matrices? Of course 
there is only one sensible answer to this question, and it is correct.: 

Lemma 5.15. Let A be the matrix representing a linear map n : R" — > R'" 
until respect to the standard bases. Then the dual map cv v : (/?"') v — (R") v is 
represented by the transpose of A with respect to the corresponding dual bases. 

The (easy) verification of this fact is left to the reader (Exercise 5.10). The 
upshot is that the dual of the cokemel of a matrix A is the kernel of the transpose 
of A. 

Left -exact ness of duality implies restrictions on which modules may appear as 
duals of other modules: 

Proposition 5.16. Let R be an integral domain, and let M be an R -module. Then 
A/ v is tors ion- free. 


Proof. There is a surjection R ihS — A/, thus, an exact sequence 


^tp7' ^ 


M -> 0. 


Dualizing, A/ v is realized as the kernel of the induced map R s — • R T : hence A/ v 
may be identified with a submodule of a product R s . The latter has no nonzero 
torsion, so neither does A/ v . □ 




542 


VIII. Linear algebra, reprise 


The diligent reader has proved this already in Exercise VI.4.2 and probably 
by a more direct way than what we did just now, but never mind. In any case, 
remember that dualizing kills torsion. Practice this by working out Exercise 5.11. 

Even if not every module is the dual of a module, we may wonder whether an 
arbitrarily given module M has same relation with a suitably chosen dual, and this 
is indeed the case: there is a canonical map to the ' double-dual 

To define this, consider the U-bilinear map 

M v X M -» R 

sending (/, m) to /(m). For every m e M, we get 23 an F-linear map Af v —* R, 
f >— ► /(m), in other words, an element u i(m) of M w . It is immediately checked 
that oj is .ft-liuear. 

By Proposition 5.16, A/ vv is torsion-free. In fact, the double-dual construction 
is a standard way to ‘clean up’ a module, removing its torsion. 

The map ai is interesting even when torsion is not an issue. As we have seen 
in Corollary 5.7, if F is a finite-rank free F-module, then a choice of basis for F 
determines an isomorphism F Sf F v ; but this does indeed depend on the basis, so 
it is not a canonical isomorphism. Doing it twice, 

j? w ? 

may seem a priori bound to make the situation even worse— surely, if we com- 
pose two noncanonical maps, then we cannot expect to get something canonical, 
right? Well, we do in this case, since this composition is nothing but the map w 
(Exercise 5.13) and co knows nothing about choices of bases or other such ambigui- 
ties. Therefore, our slogan admits the following addendum: finite-rank free modules 
are uoncanonically isomorphic to their duals and canonically isomorphic to their 
double-duals. 

In terms of matrices (and keeping in mind that duality is exact on free modules) 
this is perhaps not too surprising. After all, taking the transpose of a matrix twice 
simply gives the matrix back; thus, even if making sense of the homomorphism 
corresponding to the transpose of a matrix may depend on the choice of a basis, 
surely such a choice is inessential if the transpose is taken twice. 

Incidentally, a module M is reflexive if the corresponding map w is an iso- 
morphism. Thus, reflexive modules are necessarily torsion-free and finite-rank free 
modules are reflexive. 

5.6. Duality on vector spaces. In all these considerations, the case of vector 
spaces over a field k is only special in that vector spaces are free. Collecting the 
foregoing observations (including Exercise 5.1) for this particular case, we have the 
following: 

— Duality is a contravariant, exact functor on fe-Vect. 

■^The leader with a categorical frame of mind can view this a little more formally by working 
out Exercise 5.12, 
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— If V, W are vector spaces, then (V ® W) v = V w ffi W v . 

— If V, W are vector spaces and dim V < oo or dim W < oo, then Homj.(Vj W) 
is isomorphic to V v W. 

— If diint V < oo, a choice of a basis on V determines an isomorphism V = V V . 
— Finite-dimensional vector spaces are reflexive: if dim*.. V < oo, then the 
canonical map iv :V -* V vv is an isomorphism. 


Exercises 


As usual, R is a fixed commutative ring unless stated otherwise. 

5.1. t> Prove that if F is a free E-module of finite rank and N is any E-module, 
then Homtf(.F, N) = F v <3>r N. [§5.6] 

5.2. -> Let a : A — > B he a homomorphism of E-modules. Prove that a is an 
epiuiorphism if the induced map 24 a* : Hom^fi, N) — * Homft(.A, N) is injective 
for all E-modules N and a is a monomorphism if a* is surjective for all N. 

Prove that the converse to the first statement holds and the converse to the 
second statement does not hold. However, show that if a admits a left-inverse, then 
a* is surjective for all N. [5.3, 6.3) 

5.3. Prove that a sequence 

0 s -A—^B-^C >0 

of E-modules is exact if the induced sequence 

(*) 0 >■ Horn /t (C, N) Homft(E, N) Homft(A, N) » 0 

is exact for all E-modules N. (You have done most of this already, in Exercise 5.2. 
To show ker 7 C ima, choose N = fi/im(a).) Remember that the converse does 
not hold, since in general Homft(_, N) is not exact. What extra hypothesis on a 
would guarantee the exactness of (*) for all IVY 

5.4. -i Let I be an ideal of E. As 1 2 C /, there is a natural restriction map 
Honift(J, R/I) — > Honift(/ 2 ,E//). Prove that the image of this map is 0. Prove 
that Horn R (//f 2 , R/I) = Homft(/,E//)- (This module is important in algebraic 
geometry, as it carries the information of a ‘normal bundle’ in good situations.) 
[6.20] 

5.5. t> Prow that the evaluation map M v ®rF — *■ Honift(M, F) is an isomorphism 
if F is free of finite rank, providing an alternative proof of Proposition 5.5. [§5.2] 

5.6. > Show that (Z/2Z) V = Hom z (Z/2Z,Z) = 0. [§5.4] 

5.7. > Prove ‘directly’ that (E® s ) v = R s : how does an E-linear map E eS -» E 
determine a function S R, and what is the inverse of this correspondence? [§5.3] 

" 'Note that the ‘set-theoretic’ quality of or* is all that is needed here; in this problem, the 
•R-module structure of Horn# is irrelevant. Thus, the result also holds for noncommutative R. 
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5.8. Prove that the datum of an /{ linear map M —> M v is equivalent to the datum 
of an /{-bilinear map M x M — * R, and explain why this equivalence can be set up 
in two ways. If F = R" is a free /{-module of finite rank, determine the bilinear 
map F x F — * R corresponding to the isomorphism F F v given in Corollary 5.7. 

5.9. An /{-bilinear map : M x M —* R is nondegenerate if the induced maps 
M — ► M v are injective, and it is nonsingular if they are isomorphisms. The notions 
coincide if M is a finite-dimensional vectors space. Prove that the ‘standard inner 
product’ in K" (defined in Exercise VI.6.18) is nondegenerate. 

If M is free of rank n, let (ei, . . . ,e„) be a basis of M, and let A = (a,j) be 
the matrix with entries = <p(e t ,ej). Prove that 9 is nondegenerate if and only 
if det A is nonzero, and it is nonsingular if and only if det A is a unit. (Cf. Propo- 
sition VI.6.5.) 

5.10. Prove Lemma 5.15. 

5.11. t> Let M be a finitely generated module over a PID, of rank r. ‘Compute' 
the dual A/ v . [§5.5] 

5.12. > Let M , N be /{-modules. Show that there is a canonical bijectioii 

Honi/?(A r , A/ v ) Hoi n R (M,N' / ). 

Choosing N = A/ v , the left-hand side has a distinguished element, namely the 
identity A/ v — * A/ v . Prove that the corresponding element on the right is the map 
ui : M —* A/ vv defined in §5.2. [§5.5] 

5.13. > Let F S /?" be a finite-rank free /{-module. Verify that the composition 
of the (noncaiionical) isomorphisms F = F v = F vv from Corollary 5.7 is the 
(canonical) isomorphism w defined in §5.2. [§5.5] 

5.14. Let F be a free /{-module (of any rank). Prove that the canonical map 
F — ► F vv is injective. What is the simplest example you know of a module M such 
that M — * M vv is not injective? 

5.15. -> Let V be a vector space, and let IV C V be a subspace. The annihilator 
of W is 25 

IV 1 := {t> e V v | (Vw S IV), v(w) = 0}. 

Prove that IV x is a subspace of V v . If dimV = n and dim IV = r, prove that 
dim IV X = n — r. 

Assuming V is finite dimensional, prove that, under the canonical isomorphism 
V vv = V, IV XX maps isomorphically to IV. [5.16, 5.17] 

5.16. Let 

0 ♦ F\ > F> > F 3 > 0 

be au exact sequence of free /{-modules. Viewing F\ as a submodule of F 2 and 
extrapolating the notation introduced for vector spaces in Exercise 5.15, prove that 
F x Sf F 3 v . 

-''The notation is inspired by Exercise VI.6. 17, which is a particular case of this problem. 
Do you see why? 




6'. Projective and injective modules and the Ext functors 


545 


5.17. -> Let V be a vector space of dimension n. Prove that there is a natu- 
ral Injection between the Grassinanuian Gr(r, V) of r-dimensional subspaces of V 
(cf. Exercise VI.2.13) and the Grassinanuian Gr(n - r, l /v ) of (n - r)-dimeusional 
subspaces of the dual of V . (Use Exercise 5.15.) 

In particular, the Grassniannian Gi>(« - l.n) has the same structure as the 
projective space PV = Grjt(l,n). We could in fact define the projective space 
associated to a vector space V of dimension n to be the set of subspaces of ‘codi- 
mcnsiou 1’ (that is, dimension n - 1) in V. There are reasons why this would be 
preferable 21 ’, but established conventions are what they are. [VI.2.13] 

5.18. Let F be a free /^-module of finite rank. For any r > 1, define a multilinear 
map 

6: Fx-- xFxF v x---x F v -*R 


by 


<*(«! «... U’l U’r) = <let(l/!i(l^))i = l..... r . 

j= '■■■’■ 


• Prove that S is multilinear and alternating in the first r and in the last r entries. 

• Deduce that <5 induces a bilinear map <5 : A ' (F) x A ' (F v ) — * R. 

• Prove that 6 induces an isomorphism (A ' (F)) v — A' (F v ). 


5.19. Let R be a ring, not necessarily commutative, and let M be a fe/f-/?-module. 
Prove that A/ v carries a natural right-R- module structure. Prove that /? v is iso- 
morphic as a ring to the opposite ring R°. (Cf. Exercise III. 5.1.) 


6. Projective and injective modules and the Ext 
functors 

We have seen in §5.1 that the Horn functors are left-exact, and not exact in general. 
Studying this matter further leads us along a path parallel to the one traveled in 
the case of tensor products. In that case, we singled out a class of special (‘flat’) 
modules for which 0 is exact, and we discovered that the lack of exactness of tensor 
products may be precisely measured in terms of a whole collection of new (‘Tor’) 
functors. Precisely the same situation occurs for the covariant and contra variant, 
flavors of Horn, and we will close the chapter by summarizing this story. 

This brief overview should suffice for standard applications the reader may en- 
counter. We are also hoping that it will serve as motivation for the next, chapter: 
the wonderful facts that we will present here without, proof (on Ext functors, anal- 
ogous to the facts about Tor that were left without proof in §2.4) could be proven 
in detail by ad hoc methods, without too much effort: but they find their most 
natural setting in the wirier apparatus of homological algebra and were in fact at 


2<i The matter is not moot: it is true that there is a bijection between PV and Gr(n - 1. V). 
but this bijection depends on the choice of a basis. Thus we cannot simply identify PV and 
Gr(n - 1. V'). We can identify W and Gr(n - 1. V' v ), and it is sometime useful to do so. 
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the root of its development. The reader who bears with us throughout Chapter DC 
will see complete proofs of all the facts advertised here and substantially more. 

As in the rest of the chapter, R will be a commutative ring. As elsewhere, the 
hypothesis of commutativity is essentially unnecessary'; it has the convenient conse- 
quence that the objects defined below (such as Extjj(M, N) for two ii-modules M 
and N) are naturally endowed with an iJ-module structure. If R is not necessarily 
commutative, they are ‘just’ abelian groups. 

6.1. Projectives and injectives. 

Definition 6.1. Let R. be a (commutative) ring. 

• An .R-module P is projective if the functor Horn j?(P,_) is exact. 

• An R-module Q is injective if the functor Hom/?(_, Q) is exact. _i 

Since Horn is left-exact in any case, these definitions admit the following straight- 
forward translations: 

Lemma 6.2. An R-module P is projective if and only if for all epimorphisms of 
R-modules p : M — > N, every R-linear map p : P — * N lifts to an R-linear map 
p : P -» M: 


P 



An R-module Q is injective if and only if for all monomorphisms or R-modules 
X \ L —* M, every R-linear map q: L -* Q extends to an R-linear map q : M — * Q: 

Q 



Proof. This is straightforward. Since Hom^(_, Q) is left-exact for all Q, Q is 
injective if and only if whenever a sequence 

0 >L >M 

is exact, then so is the induced sequence 

Horn R (M, Q) > Homft(L, Q) > 0. 

This translates precisely into the given condition. The argument for projective 
modules is entirely' analogous. □ 

Example 6.3. Taken as a module over itself, R. is projective (because Honift(R, M) 
is canonically isomorphic to M, for all modules M) but not injective in general 
(since the duality functor Homji(_, R) is not exact in general; cf. §5.1). j 
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A variation on the theme of Lemma 6.2 rephrases the projective/injective con- 
ditions in terms of the splitting of sequences (cf. the end of §111.7.1). For example, 
assume that P is projective; then we claim that every exact sequence 

0 > L — > M — > P >0 

splits, in the sense that there is a submodule P' of M such that p restricts to an 
isomorphism P' — * P and M S A (L) © P > . Indeed, since P is projective, then the 
identity P ^ P lifts to a homomorphism p : P — * M, and the reader can then 
verify that P' — p(P) fits the requirement. Loosely speaking, in this situation we 
can simply replace M by L © P. 

Similarly, if 

0 >Q >M > N >0 

is exact and Q is injective, then the sequence necessarily splits. For example, Z is 
not an injective Z-module, since the exact sequence 

0 >Z >Z— 

manifestly does not split. 

It is not difficult to show that these properties in fact characterize projective 
and injective modules. With the terminology introduced in §111.7.1, a module P 
is projective if and only if every epimorphism M -» P is a split epimorphism, 
and a module Q is injective if and only if every monomorphism Q —* M is a split 
monomorphism. Dotting all the i’s mid crossing all the t’s is left to the enterprising 
reader (Exercise 6.1). 

6.2. Projective modules. Lemma 6.2 does not necessarily help in acquiring a 
feeling for projective or injective modules. We are reasonably happy with projective 
modules, because of the following characterization: 

Proposition 6.4. An R- module P is projective if and only if it is a direct summand 
of a free module, that is, if and only if there exists a free R-module F, an R- 
module K, and an isomorphism K ® P= F. 

Proof. Any set S of generators of P determines a surjection of the free module 
F = R® s onto P and hence an exact sequence 

0 >K >F >P >0 . 

As observed above, such a sequence necessarily splits if P is projective; thus F = 
K © P in this case, as needed. 

For the converse, assume that /fffiPSffor some free P-module F. It is clear 
that F satisfies the lifting property of Lemma 6.2: 

F= K®P 

S' I' 

M j *N- 


+ 0 
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(if F = F R (S), define /(s) to be an arbitrary inverse image of f[s), for all a € S, 
and extend to F by the universal property of free modules). It follows easily that the 
lifting property holds for P, by judicious use of the natural maps P <-> K(&P -» P. 
Details are left to the reader. □ 

For example, free modules are projective. Proposition 6.4 streamlines the ver- 
ification of simple properties of projective modules: 

Corollary 6.5. Let Pi, P* be projective R-modules. Then Pi © P 2 and Pi <8 r Pi 
are projective. Projective modules are flat. 

Proof. These statements follow easily from Proposition 6.4, the fact that ® is 
distributive with respect to ©, and the fact that free modules are flat. □ 

The categorically minded reader should look for alternate proof* of these facts. 
For example, if Pi and Pi are both projective, the functor Homji(Pi, Homjj(P 2 ,_)) 
is exact; by adjunction, this is the functor HomR(P» ®r P 2 ,_); since this is exact. 
Pi ®j? P 2 is projective. 

A proof via adjunction of the fact that projective modules are fiat is straightfor- 
ward (Exercise 6.7), using the (nontrivial) fact that P.-Mod has enough injectives, 
discussed below. As it happens, finitely generated flat modules over, e.g., Noether- 
ian rings, are projective; the diligent reader has already done all the work necessary 
to prove this, in the exercises. 

The fact that free modules are projective implies that P-Mod has enough pro- 
jectives: this means that every P-module M admits a projective resolution, that is, 
an exact sequence 

> P3 ► P2 ► Pi » Po > M • 

Indeed, a free resolution of M is in particular a projective resolution. Since there 
are in general ‘more’ projective modules than free modules, it can in principle be 
‘easier’ to construct a projective resolution. The magic of homological algebra shows 
that projective resolutions may be used in place of free resolutions in (for example) 
computing Tor-modules. This will be clear once we go through some homological 
algebra machinery, in Chapter IX, especially §IX.8. 

6.3. Injective modules. We may have inadvertently communicated to the reader 
the message that every categorical notion leads in a straightforward way to a mirror 
notion, by ‘just reversing arrows’. This would seem to be the case for injective 
vs. projective modules: the definition of one kind is indeed obtained from the 
definition of the other by simply reversing arrows in the key diagrams. 

However, this is as far as this naive perception can go. A seemingly unavoidable 
asymmetry of nature manifests itself here, making injective modules look more 
mysterious than pro jectives. For example, there is no ‘easy’ description of injective 
modules in the style of Proposition 6.4 (however, cf. Remark 6.13), and while it is 
trivially the case that an arbitrary module is surjected upon by a projective (e.g., 
free) module, the ‘minor’ statement for injectives is certainly uot trivial. 
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The problem is that it is a little hard to picture an injective module: while 
R itself is a trivial example of a projective R-module, at this point the reader 
would likely find it hard to name a single nonzero injective module over any ring 
whatsoever. Well, say over any ring that is not a field: if R is a field, then R itself 
is injective. (Why?) But if R. is not a field. . . V Suppose r is a non- zero-divisor in 
R,, and consider the inclusion of the ideal (r) R in R: 

0 >R— ^R; 

extending the identity on R through the multiplication by r, 

0 >R— r -*R 



R. 


requires the existence of s € jR such that rs = 1. It cannot be done unless r is a 
unit in R. 

This tells us that in general R is not injective as an i?-module and puts the 
finger on the ‘problem’: for Q to be injective, we must in particular be able to 
extend any map I -* Q from an ideal I of R to a map R.-*Q. A nice application 
of Zorn’s lemma shows that this property characterizes injective modules: 

Theorem 6.6. An R.-modulc Q is injective if and only if every R-linear map f : 
I —* Q, with I an ideal of R, extends to an R.-linear map f : R-* Q. 

This criterion is attributed to Reinhold Baer, who first introduced and studied 
injective modules (ca. 1940). 

Proof. The ‘only if’ part of the statement is immediate from the definition of 
injective. To verify the ‘if’ part, assume Q satisfies the stated extension condition, 
let L C M be any inclusion of R-modules, and let q : L — » Q be a given /^linear 
map: 

0 >L * M 



We want to construct an extension q : M — » Q of q. 

Consider the set E of pairs (L, g), where L ranges over the submodules of M 
containing L and q:L-*Q extends q: <]\l = q Then E is nonempty, since (L, q) € 
E, and we can define a partial order on E by prescribing that (L', cf ) < (L" ,q") if 
L' CL and q" extends if. It is clear that every chain has an upper bound (take the 
union of the corresponding L), so by Zorn’s lemma there exists a maximal extension 
{L, q), and we are done if we can prove that L = M. 

Arguing by contradiction, assume there exists m 6 M, m £ L, and consider the 
proper ideal I = {r G R. \ rm € L} of R. We can define an R-linear map / : I — > Q 
by putting 


f(r) := q(rm)\ 
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by hypothesis, this map extends to an 5-linear map / : R. — ♦ Q. But then we can 
use this map to extend q beyond L: let X = L + (to) 2 L, and define "q : X — * Q by 

?(£ + rm) := q(£) + /(»’) 

for t 6 L, r € R. This is immediately checked to be well-defined and clearly 
extends q. But then (L,q) € E and (L, q) -< (L,q), contradicting the maximality 
of (L, q) and concluding the proof. □ 

Theorem 6.6 completely clarifies the notion of injective modules in the case of 
PIDs. An 5-module D is divisible if rD = D for every non-zero-divisor r e 5, that 
is, if we can ‘divide’ every element of D by every non-zero-divisor of 5. 

Corollary 8.7. Let B. be a PID. Then an R-module Q is injective if and only if it 
is divisible. 

Proof. Exercise 6.14. □ 

Example 6.8. Viewed as abelian groups (i.e., Z-modules), Q and Q/Z are injective. 
More generally, if I? is any divisible abelian group and K C D, then D/K is 
injective. (Indeed, it is trivially divisible!) j 

Going from friendly PIDs like Z to arbitrary rings may still seem challenging, 
but here is where some of our previous work pays off. Recall that for every ring 
homomorphism f : S —* R we haw defined in §3.3 a functor 27 f' : 5-Mod — * 5-Mod, 
by setting / ! (M) = Homs(5, M). This functor preserves injectives: 

Lemma 6.9. Let f :S — * R be a homomorphism of commutative rings, and let Q 
be an injective S -module. Then f'(Q) is an injective B.-module. 

Remark 6.10. Similarly, /* preserves projectives (Exercise 6.6). _i 

Proof. By adjunction (Lemma 3.5), 

Hom a (_, f'(Q)) a Hom s (/*(J,Q) 

as functors 5-Mod — > 5-Mod. Since /* is exact (Proposition 3.6) and Homs (_,<?) is 
exact by hypothesis, Hom/j.(_, f'(Q}) must be exact. This is precisely the statement. 

□ 

Since every ring 5 admits a (unique) map t : Z — * 5, we have a good source of 
injective objects in 5- Mod for every 5: 

Corollary 6.11. Let 5 be a commutative ring. If D is any divisible abelian group, 
then i'(D) = Homz(5, D) is injective in 5-Mod. 

This result is as close as the author can get to an ‘intuitive’ feeling for the 
notion of injective module. For example, at this point we clearly see why ‘5-Mod 
has enough injectives’; 

27 Watch out — in §3.3 the homomorphism goes from R to S, so the definition of /' may look 
backwards here. 
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Corollary 6.12. Let M be an R.-module. Then M can be identified with a sub- 
module of an injective R-modvle. 

Proof. We claim that it suffices to show that Z-Mod has enough injectives, Indeed, 
this will show that there exists a divisible abelian group D such that M C D (M 
is in particular an abelian group); since B-linear maps are in particular Z-linear, 

M = Homjj(i?, M) Homz(i?, M) C Homz(i?, D), 
and the rightmost module is injective by Corollary 6.11. 

Thus, we are reduced to the case R. = Z and M = A an abelian group. There 
is a surjective homorphism from Z® 4 to A and hence an identification 



for some K C Z® 4 . Embedding Z® 4 in Q® 4 , we obtain an injective homomorphism 



and we are done since Q® A /K is divisible (Example 6.8). □ 

As an immediate consequence, every 12-module admits an injective resolution: 
an exact sequence 

0 > M > Qa > Qi > Qi > Qz > ■ ■ ■ 

in which every Qi is injective. Indeed, this is obtained by applying Corollary 6.12 
to M to construct Q 0 and then applying it again to the cokernel of M -+ Qo to 
construct Qi, and so on. 

The fact that R- Mod has enough injectives has a generalization to sheaves, 
which is an essential ingredient in the definition of sheaf cohomology, an extremely 
important tool in modern algebraic geometry. 

Remark 6.13. Corollary 6.12 may be refined, proving that every module can in 
fact be realized as a submodule of a product Horn® (R.,Q/Z) S (this is injective, by 
Corollary 6.11 and Exercise 6.15). That is, products of i (Q/Z) - Homz(/2,0/Z) 
play for injectives the role played by free modules (= coproducts of R = r* (Z)) 
for projectives. (Not surprisingly, they are said to be cofree.) It follows that every 
injective 12-module is a direct summand of Houiz(/?, Q/Z) s for some S. j 

6.4. The Ext functors. The next step to take now is the construction of tools to 
‘quantify’ the lack of exactness of Hom^, in the same sense that the functors Tor^ 
quantify the lack of exactness of The corresponding functors are called Ext/j. 
Since there are two functors associated with Horn (the covariant Homje(M,_) and 
the contravariant Hoiu.r(_, N)), it would be natural to expect two corresponding 
sequences of functors. Amazingly, the same ‘bifunctors’ Ext#(_, _) work for both! 
For all /2-modules M and A r , we have the following: 

• For all i > 0, 

Extjj(M,J, Extfl(_, N) 

is a covariant, resp., contravariant, functor 12-Mod — ► /2-Mod. 
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• For i = 0, 

Ext%(M,N) = Horn r(M,N). 

• For every exact sequence 


0 


-» A >B >C- 


-40 


of P-niodules, there are long exact sequences 

0 > Hom B (A/. A) ► Honm( A/. B) 4 Hom B (A/, C) 

<5o 


^4 Ext B (A/, A) ■ 

c 


Ext%(M,A)- 


and 


^4 Ext B (C, 

c 


N)- 


Ext%(C,N)- 


-4 Ext B (A/, B) 

6y 


-4 Ext B (A/, B) 


S'o- 


-4 Ext B (B, N) 
S', 


-4 Ext r(B,N)- 


-4 Ext B (A/, C) 


3 

3 


-4 Ext B (A/, C) > ■ 


0 > Hom B (C, N) • -4 Hom«(S, N) > Hom B (j4, N) 


-4 Ext B (.4, jV) 


3 

3 


-4 Ext||(i4, N) - 


for suitable natural connecting morphisnis <5,, <5'. 


This will be proven in Chapter IX. In any case, just knowing that such wonderful 
things exist gives us enough information to be able to play with them a little. For 
example, 

— if Ext B (P,_) = 0, then P is projective; 

— if Ext B (_,Q) = 0, then Q is injective. 

Indeed, the exactness of the corresponding Hom B (P. _), resp., Hom B (_, Q), follows 
immediately from the exact sequences displayed above. 

In fact, pushing the envelope a little will tell us much more. Recall the pro- 
cedure that defines Torf (_, N) from the functor _ N: to compute the value of 
these functors at a module M, 

• find a free resolution of M : 


> > ft®s i > > m > 0: 

• apply the functor _ N to the free part, obtaining a complex M , N: 

> > j y@s„ > o : 

• define Tor by taking the homology of this complex: 

Tor f(M, N) ;= H t (M. ® N). 

We also pointed out that projective resolutions may be used in place of free res- 
olutions 28 . This strategy is an example of the general procedure used to ‘derive’ 

28 In fact, resolutions by flat modules suffice; this will be proven in Chapter IX. 
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functors. Can you dream up how the same strategy may be applied in order to 
compute Ext-modules? Don’t read ahead until you have thought a little about 
this! 

Welcome back. In the case of ExtJj(M, N), there would seem to be two natural 
and possibly different ways to go: 

— One could start from a projective resolution of M: 

>P 2 >Pt >Po >M >0; 

apply the contravariailt Hom/i(_, N) to the projective part, obtaining a new com- 
plex Homjj(Af,, AT), i.e., 

0 1 Homfl(Po, N) > HomR(Pi, N) !> Hom.ft(P 2 , N) > , 

and take cohomology 29 , proposing the definition 

Ext* H (JW, N) := IT 1 (Rom r(M,, N)). 

— Or one could start from an injective resolution of N: 

0 ► N ► Qo 1 Qx 1 Q 2 ^ ‘ > 

apply the covariant Hom/?(M, _) to the injective part, obtaining a new complex 
Horn g(M,N.), i.e., 

0 ► Horn j?(M, Qo) ► HomR(M, Qi) ► Horn r(M, Q 2 ) > * • * 1 

and again take cohomology, leading to 

Extjj {M,N) := /P(Hom fi (M,./V.)). 

These definitions are independent of all choices and agree with each other! 

Again, the reader will have to wait for Chapter IX to see a proof that this is 
the case and that the resulting modules do satisfy the requirements spelled out at 
the beginning of this subsection. As in the case of Tor, the impatient reader can get 
a good feel for the needed arguments by working out the case of finitely generated 
modules over PIDs, making good use of the snake lemma. 

The leader who is willing to take our word for now and believe all these beautiful 
results is already in a position to perform many computations and to understand 
injective/projective modules (even) better. For example, 

Proposition 6.14. An B -module P is projective if and only if Ext)j(P,_) = 0, if 
and only i/Ext)j(P,_) = 0 for all i > 0. 

An B-module Q is injective if and only if Ext#(_, Q) = 0, if and only if 
Extfl(_, Q) = 0 for all i > 0. 

Proof. The second assertion: we have already noted that Q is injective if Ext]j(_, Q) 
0, and this is trivially the case if Ext)j(_, Q) = 0 for all i > 0. So we just have to 


29 Recall from §111.7.1 that the homology of complexes with increasing (rather than decreas- 
ing) indices is called cohomology; this is usually denoted H', with ’upper’ indices. 
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prove that Ext^(_, Q) = 0 for all i > 0 if Q is injective. This is immediate from 
the second definition given above: if Q is injective, then 

0 >Q * Q >0 >0 >••• 

is an injective resolution of Q\ thus, for am /^-module M, Ext jj(M, Q) is the coho- 
mology of the complex 

0 ► Hom*(M, Q) > 0 > 0 *■ ■ • ■ 

giving Ext %(M, Q) = Homj?(M, Q) and Ext* K (M, Q) = 0 for i > 0 and all M. 

The first assertion is proven similarly, using the first definition given above 
for Ext. □ 

6.5. Ext|(G,Z). We will attempt to convince the reader that computing Ext 
modules is not too unreasonable, by discussing the computation of Ext|(G, Z) for 
an arbitrary finitely generated abelian group. 

— Since Horn commutes with finite direct sums (Corollary 5.3), so does Ext; 
this is an easy consequence of the definitions given above. 

—By the classification theorem for finitely generated abelian groups, we are 
reduced to computing Ext' 2 (Z,Z) and Extg(Z/mZ,Z), for all m > 0 and i > 0. 

— The first module is Homz(Z, Z) = Z for i = 0, and it is 0 for t > 0: indeed, 
this follows from Proposition 6.14, since Z is projective. 

— The second module: use the projective resolution 

0 > Z — ^ Z > Z/mZ > 0 ; 

this yields that Ext^Z/mZ, Z) is the cohomology of the complex 

0 > Hom 2 (Z,Z) — m > Homz(Z,Z) > 0 > 0 >• - • • , 

that is, 

0 >Z— ^-*Z s>0 >0 ►••• , 

confirming that Ext|(Z/mZ,Z) = Homz(Z/mZ,Z) = 0 (Z has no torsion) and 
computing Ext^Z/mZ, Z) = Z/mZ, with vanishing Ext 2 and higher. 

— The conclusion is that Ext^G, Z) picks up the free part of G, while Extz(G, Z) 
is isomorphic to the torsion part, and Ext^G, Z) = 0 for i > 2 (as should have 
been expected; cf. Exercise 6.18). 

In particular, if G is a finite abelian group, then 

G = Extz(G,Z); 
as it happens, this isomorphism is not canonical. 

Another Ext computation gives a different interpretation of this group. The 
exact sequence 

0 > Z > Q J- Q/Z > 0 

is an injective resolution of Z; thus, Ext^G, Z) is the cohomology of 


0 


+ Homz(G, Q) > Homz(G, Q/Z) ► 0 
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If G is torsion (for example, if G is finite), it follows that Ext|(G, X) is isomorphic 
to Homz(G,Q/Z). 

This group is the Pontryagin dual of G, denoted G. Thus, we have verified 
that if G is a finite abelian group, then G is isomorphic (not canonically) to its 
Pontryagin dual G = Homs(C,Q/Z). Of course this fact is not difficult to check 
directly, but it is immediate from the point of view of Ext. 

Finally, the notation Ext is due to the fact that one can construct a bijection 
between the group Ext}j(M, N) and the set of equivalence classes of extensions 
of M by N , that is, of exact sequences 

0 >M >E >N >0 

modulo a suitable isomorphism relation (cf. §IV.5.2). For example, the direct sum 
E = M © JV corresponds to the 0 element of this Ext group. The diligent reader 
will construct this bijection in the exercises. 


Exercises 


As usual, R denotes a fixed commutative ring. 

6.1. > Prove that an B-module P is projective if and only if every epimorphism 
M — » P splits and Q is injective if and only if every monomorphism Q —* M splits. 

[S6-1] 

6.2. Prove that the result of Proposition 5.13 holds more generally whenever P is 
a projective module. 

6.3. -< Prove that an iJ-Iinear map A — ► B is injective if and only if the induced 
map Hom/i{i?, Q) — » Homft(A, Q) is surjective for all injective modules Q. (Hint: 
.ft-Mod has enough injectives.) This strengthens the result of Exercise 5.2. [6.7] 

6.4. -i Let {P>}tei be a family of i?-modules. Prow that Pi is projective if and 
only if each Pi is projective. [IX.5.4] 

6.5. Prove that the dual of a projective module is projective. Prove that finitely 
generated projective modules are reflexive (that is, isomorphic to their biduals). 

6.6. -i If / : S — > R is a ring homomorphism and P is a projective S'- module, then 
/*(P) is a projective P-module. (§6.3) 

6.7. t> Give a proof of the fact that projective modules are flat, using adjunction. 
(Hint: H-Mod has enough injectives, and Exercise 6.3.) [§6.2, 6.8] 

6.8. Prove that Exercises 2.24 and 6.7 together imply the result of Exercise VI.5.5. 

6.9. Prove that vector spaces are fiat, injective, and projective. (Try to do this 
directly, without invoking Baer’s criterion.) 

6-10. Prove that a finitely generated module over a PED is free if and only if it is 
projective if and only if it is fiat. (Do this ‘by hand’, without invoking Exercise 6.12.) 
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6.11. -i Prove that a finitely generated module over a local ring is projective if and 
only if it is free. (You have already done this, in Exercise VI.5.5.) 

Besides PIDs and local rings, there are other classes of rings for which projec- 
tive and free modules coincide. Topological considerations suggested to Serre that 
projective modules over a polynomial ring over a held should necessarily be free, 
but it took two decades to prove that this is indeed the case. [6.12J 

6.12. -i An B-module M is ‘locally free’ if M„ is free as an J? p -module for every 
prime ideal p of B.. 

Prove that a finitely generated module over a Noetherian ring is locally free if 
and only if it is projective if and only if it is flat. (Use the results of Exercises 2.24 
and 6.11.) 

The hypothesis of finite generation is necessary (cf. Exercise 6.13). [6.10J 

6.13. -> Prove that Q is fiat, but not projective, as a Z-module. [6.12] 

6.14. > Prove that a module over a PID is injective if and only if it is divisible. 
(Use Baer’s criterion.) (§6.3] 

6.15. > Prove that Q\ is injective if and only if Q \ , Qi are both injective. More 
generally, prove that if {<2t }ieJ is a family of JZ-modules, then Qi * s injective if 
and only if each Qi is injective. [§6.3, tX.5.4] 

6.16. Prove that Z © Q is fiat as a Z- module but neither projective nor injective. 

6.17. Prove that Horn r(M, N) = Ext^(Af,iV), using any of the definitions pro- 
vided for Extj}(M, N). 

6.18. i> Prove that if R is a PID, then ExtR(M, N) = 0 for all i > 2 and all 
^-modules M, N. [§6.5] 

6.19. Let r be a non-zero-divisor in B, and let M be an f?- module. Compute all 
Ext‘(ff/(r),M). 

6.20. Let I be an ideal of B. Prove that Ext )}(/?//, B/I) = Hom«(/// 2 , B/I). 
(Cf. Exercise 5.4. This says that this Ext module essentially computes the normal 
bundle of an embedding.) 

6.21 . -i In Exercise 2. 15 we have seen why Tor is called Tor. Why is Ext called Ext? 
Let M,N,E be B-modules. An extension of M by N is an exact sequence 

(*) 0 >N >E > M >0. 

Two extensions are ‘isomorphic’ if there is a commutative diagram 
0 > N >E — >0 

0 >N >E’ ► M ►O 

linking them. (Note that, by the snake lemma, the middle arrow must then be an 
isomorphism: cf. Exercise III. 7.10.) Extensions that are isomorphic to the standard 
sequence > M — > 0 are ‘trivial’. 
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Every extension § as above determines an element e = e(S) € Ext 1 (M,N) as 
follows. The sequence (*) induces a long exact sequence of Ext, i.e., 

s- Hom fl (£, N) ► Hom«(AT, N) > ExtJj(M, N) > • . 

and we let e(<?) be the image in Ext r(M, N) of the distinguished element idjv € 
Hom*(lV,jV). 

Prove that if S and S' are isomorphic extensions, then e(S) = e(S'). Prove 
that if if is a trivial extension, then e(S) = 0. (6.22] 

6.22. Exercise 6.21 teaches us that extensions determine elements of Ext)j. We 
get an even sharper statement by constructing an inverse to the map e: for every 
element k g Ext)j(M, A r ), we will construct an extension e(«) such that e(e(/c)) = k 
and such that e(«(<?)) is isomorphic to S. 

• Let F be any free module surjecting onto M, and let i : K <-* F be the kernel of 
it : F -* M. Since F is free (hence projective), a piece of the long exact sequence 
of Ext, i.e., 

► Hom ft (if, N) > Ext }j(M, JV) » Ext^F, N) = 0 ► 

tells us that there exists a homomorphism fe : K —* N mapping to the element 
k e Ext)j(M,iV). 

• We now have a monomorphisu (i, k) : K —* F © N. Let E be the cokeruel 
(F © N)/K. Prove that the epimorphism (x, 0) : F © N — > M factors through 
this cokernel, defining an epimorphism E -» M. 

• Prove that the natural monomorphism N = 0®N F®N defines a monomor- 
phism N «-» E, identifying N with the kernel of E — ♦ M. 

• We let e(«) be the extension 

0 >N >E >M >0 

that we have obtained. Prove that different choices in the procedure lead to 
isomorphic extensions. 

• Prove that e(e(S)) is isomorphic to S and that e(S(n)) = k. 

The upshot is that there is a natural bisection between Ext}j(M, N) and the set 
of isomorphism classes of extensions of M by N. Hence the name. ‘Higher’ Ext# 
(t > 1) may also be treated similarly: for example, Ext^(M, N) can be analyzed in 
terms of two-step extensions consisting of exact sequences 

0 >N *-Ei >E 2 i ‘M >0, 

where two such extensions are ‘isomorphic’ if there is a diagram 

0 > N >Ei >E 2 >M >0 

I I 

0 >N >E' 2 >M >0 

This approach to Ext is attributed to Nobuo Yoneda. 
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Homological algebra 


The limited exposure to the Tor and Ext functors given in §VIII.2 and jjVIII.fS 
should assist the reader in occasional encounters with these functors. However, it is 
very worthwhile to go back, dotting all i’s and crossing all t’s as necessary to prove 
the wonderful facts responsible for the existence and the behavior of these functors. 

This is one motivation for the material covered in this chapter. Tor and Ext 
are examples of derived functors, a machinery that would give us access to several 
other important constructions, such as group or sheaf cohomology. Exploring any of 
these applications would take us too far, but understanding the basics of homological 
algebra, including a thorough look at derived functors, is in itself a very worthwhile 
goal. The material covered here should arm the reader with enough information to 
be able to absorb any application quickly when the time comes and the opportunity 
arises. All the wonderful mysteries about Tbr and Ext stated in Chapter VIII will 
be recovered as very particular cases of the results presented in this chapter. 

The natural context in which to play this game is that of abelian categories: 
these are categories carrying good notions of kernels and cokernels, with properties 
to which the reader is accustomed due to long exposure to the category I?- Mod 
of modules over a ring. In fact, elementary (and not so elementary) presentations 
of homological algebra often take the standpoint that one may as well work with 
J?-Mod; allegedly, nothing of substance is lost by doing this rather than trying to 
work in the more natural context of abelian categories. 

We are not completely convinced. This pedagogical device has strong theo- 
retical backing, provided by the Freyd-Mitchell embedding theorem: every small 
abelian category is equivalent to a full subcategory of the category of left-modules 
over a (not necessarily commutative) ring. However, we believe that some expo- 
sure to purely ‘arrow-theoretic’ arguments is intrinsically beneficial, and the reader 
should have the chance to acquire a taste for this type of reasoning. In any case, 
we dislike the idea of simply asking the reader to believe that ‘most of what follows 
works in a more general setting’, without providing some evidence for this fact. 
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Therefore, we will take a little time to look at abelian categories before getting 
into homological algebra proper; this will be done in the first two sections of this 
chapter, which an impatient reader could probably skip over without much harm. 
While we will not prove the Freyd-Mitchell embedding theorem, we will give a solid 
justification for the fact that we can indeed perform diagram chases by working with 
‘elements’, in the sense of (for example) checking whether a diagram is commutative 
or whether a given sequence is exact. Proving this will give us good practice in 
maneuvering arrow-theoretic arguments; once we are done with it, we can indeed 
develop homological algebra using conventional ‘element-theoretic’ arguments, with 
a relatively clean conscience. 

As for homological algebra proper, our general guiding principle will be to 
hunt for the ‘essence of (co)homology’: what information iu a complex (see §111.7) 
is really responsible for its homology. This will motivate most constructions in this 
chapter, including derived functors. A guiding beacon throughout the chapter will 
be the notion of derived category: we will not construct derived categories in full 
generality, but the reader should nevertheless emerge with a basic understanding 
of what they are. One reason for this compromise is that we felt that delving into 
triangulated categories would take us too far; they are only mentioned iu passing 
at the end of the chapter. Balancing Tor and Ext will motivate the introduction 
of double complexes, and these in turn will motivate a quick treatment of spectral 
sequences, in tbe last section. 


1. (Un) necessary categorical preliminaries 

1.1. Undesirable features of otherwise reasonable categories. The cate- 
gory of modules over a ring i?. is the template example of an abelian category: a 
category for which it makes sense to talk about ‘kernels’ and ‘cokemels’, ‘com- 
plexes', ‘exactness’, etc., and in which convenient results such as the ‘snake lemma’ 
hold. We begin by trying to concretize these thoughts a little. 

As motivation for the key definitions, we will first list a few' annoying features 
of some of the categories we have encountered. 

• A category may not have initial or final objects; even if they are there, they 
may not be the same objects (as in Set). 

• Products, coproducts may not exist (if the product of F 2 and F 3 existed in Fid, 
what would its characteristic be?). 

• It may make no sense to talk about kernels and cokemels in a category; even 
when it does make sense, they may not exist (kernels do not necessarily exist 
in the category of finitely generated modules over a ring; cf. Example III.6.5). 

• Even when there are kernels and cokernels, monomorphisms need not be ker- 
nels (nonnormal subgroups are not kernels: remember §11.7.6), and epimor- 
phisms need not be cokemels (the injection Z — » <Q is an epimorphism in Ring; 
see §111.2.3). 

• A morphism may be a monomorphism and an epimorphism without being an 
isomorphism (Ring again, same example). 
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Homological algebra will deal extensively with kernels, cokernels, exact sequences, 
and the like: the material covered in §111.7 will be our starting point. That material 
was developed for the category .ft- Mod of (left-)modules over a ring, for which 
none of the annoying things listed above occurs: in .ft- Mod every epimorphism is 
a cokernel, a map is an isomorphism if and only if it is a monomoiphism and an 
epimorphism, there is a zero-object (= both initial and final), and so on. 

The notions of ‘additive’ and ‘abelian’ category extract the few key properties 
that guarantee that no pathologies such as those listed above may occur. Working 
in such categories ‘feels’ quite a bit like working in f?-Mod, in the sense that all 
expected properties of kernels and cokernels hold. The Freyd-Mitchell embedding 
theorem (see §2.4) will provide a mathematical reason supporting this psychological 
fact. 


1.2. Additive categories. 

Definition 1.1. A category A is additive if it has a zero-object, A has both finite 
products and finite coproducts, and each set of morphisms Houia(A, B) is endowed 
with an abelian group structure, in such a way that the composition maps are 
bilinear. A functor between two additive categories is additive if it preserves the 
abelian group structures on Hom-sets. _i 


None of the categories mentioned in the list in §1.1 is additive. Even Grp (which 
has kernels and cokernels, products, etc.) is not an additive category: the Hom-sets 
in Grp do not haw a natural abelian group structure. Of course Ab is additive, and 
so are all categories ft-Mod. 

If a category A is additive, it makes sense to talk about ‘zero-morphisms’ (which 
we noil denote by 0) and to add or subtract morphisms; two morphisins a, fi are 
equal if and only if a — fi = 0. One can also talk about kernels and cokernels in 
A, by adopting the categorical definitions as suitable limits and colimits (Exam- 
ple V1I1.1.11). Here are those definitions again, expanded for intelligibility: 


Definition 1.2. Let <p : A -* B be a morphism in an additive category A. A 
morphism i: K —* A is a kernel of <p if tp » i — 0 and for all morphisms f : Z — * A 
such that <p<>C = 0 there exists a unique £ : Z —> K making the diagram 

o 



commute. A morphism re : B — > C is a cokernel of ip if = 0 and for all 
morphisms fi \ B —* Z such that fi o ip = 0 there exists a unique fi : C —* Z making 
the diagram 



o 

commute. j 
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Here is the same in sound-bite format: 

If tp o C = 0, then C factors uniquely through ker tp. 

If 0°tp = 0, then /3 factors uniquely through coker tp. 

We have to get used to the feet that morphisms may have ‘many’ kernels (or 
cokemels), uniquely identified with each other by virtue of being answers to a 
universal question (Proposition 1.5,4); it is common to harmlessly abuse language 
and talk about the kernel and the cokernel of a morphism. Further, we have to get 
used to the fact that the kernel i : K -* A of a morphism A — *■ B is a morphism. In 
a category such as Ab we are used to thinking of the kernel as a subobject of A, but 
this is really nothing but the datum of au inclusion map: the kernel is really that 
map. hi au arbitrary category one cannot talk about ‘subobjects’ or ‘inclusions’; 
the closest one can get to these notions is monomorphism. Similarly, ‘surjective’ is 
not really an option; epimorphism is the appropriate replacement. Recall (§1.2.6) 
that tp : A — * B is a monomorphism if for all parallel morphisms Cl i C2 : % —l * A, 

Gi . (p _ 

Z : A — — » B, 

<3 

= v 5 ° C 2 =4> Ci = C 2 - It is an epimorphism if for all parallel A, A : B -* Z, 

tp Pi , 

A —£-4 B =3 Z, 

fo 

fiioip = Pi°tp =*■ /?i = One benefit of working in an additive category is 
that these definitions simplify a little: 

Lemma 1.3. A morphism tp : A—* B in an additive category is a monomorphism 
if and only if for all ( : Z — i ► A, 

tp°C = 0 =*■ C = 0. 

It is on epimorphism if and only if for all (3 : B —* Z, 

ft o tp = 0 =► /3=0. 


Proof. This is simply because two morphisms with the same source and target 
are equal if and only if their difference in the corresponding Hom-set (which is an 
abelian group by hypothesis) is 0. □ 

Are kernels necessarily monomorphisms? Yes: 

Lemma 1.4. In any additive category, kernels are monomorphisms and cokemels 
are epimorphisms. 

We will run into several such ‘dual’ statements, and we will give the proof for 
one half, leaving the other half to the reader; as a rule, the two proofs mirror each 
other closely. There is a good reason for this: the opposite (cf. Exercise 1.3.1) of 
an additive category is additive, and kernels, monomorphisms, etc., in one corre- 
spond to cokernels, epimorphisms, etc., in the other. Thus, proving one of these 
statements for all additive categories establishes at the same time the truth of its 
dual statement. 
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However, going through the motions necessary to produce stand-alone proofs 
of the dual statements makes for good practice, and we invite the reader to work 
out the corresponding exercises at the end of this section. It is invariably the case 
that these arguments are relatively easy to understand and picture in one’s mind 
but rather awkward to write down precisely. This seems to be a feature of many 
arrow-theoretic arguments and probably reflects inveterate biases acquired by early 
exposure to Set. 


Proof. Let p : A — * B be a morphism in an additive category A, and let coker p : 
B — ► C be its cokernel. Let 7 : C — * Z be a morphism such that 7 o coker p = 0 . 
The composition (70 coker p)°p is 0; by definition of cokernel, 7 « coker p factors 
uniquely through C: 


A 




4B- 


coker p 



Since 70 coker (p = 0 = Oo coker y?, the uniqueness forces 7 = 0 , This proves that 
coker v? is an epimorphism, by Lemma 1.3. 

The proof that kernels are monomorphisms is analogous and is left to the reader 
(Exercise 1.9). □ 


Certain limits are guaranteed to exist in an additive category: finite products 
and coproducts do. On the other hand, kernels and cokernels (which are also limits) 
do not necessarily exist in an additive category. For example, the category of finitely 
generated modules over a ring is additive, but it does not have kernels in general 
(essentially because a submodule of a finitely generated module is not necessarily 
finitely generated). But as soon as a morphism p does have kernels or cokernels 
in an additive category, the basic qualities of that morphism can be detected ‘as 
usual' by looking at Verp and coker p. For example, 

Lemma 1.5. Let p : A — » B be a morphism in an additive category. If p has a 
kernel, then p is a monomorphism if and only if 0 — *• A is its kernel. If p has a 
cokemel, then p is an epimorphism if and only if A -* 0 is its cokernel. 

Compare this statement with Proposition 1II.6.2! 

Proof. Let’s do kernels this time. 

First assume <p : A — * B is a monomorphism with a kernel t : K —* A. In 
particular, top: K — * A— * Bis the zero-morphism; therefore i = 0 by Lemma 1.3. 
What about K'! If £ : Z — * A is any morphism such that the composition Z —> 
A — » B is 0, then £ factors through t = 0 since i is the kernel of p. It follows that 
£ = 0 and in particular that it factors (umquely) through 0 — * A. This proves that 
0 — * A is a kernel of p, as stated. 

Conversely, assume that 0 — » A is a kernel for p : A —* B, and let C : Z -* A 
be a morphism such that <p°£ = 0. It follows that £ factors through 0 — > A, since 
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the latter is a kernel for y>: 


o 



0 


This implies £ = 0, proving that <p is a monomorphism. 

The statement about epimorphisms and cokernels is left to the reader (Exer- 
cise 1.9). □ 

In view of Lemma 1.5, we should be able to use diagrams 
0 - 4 - £, A->B-+ 0 

to signal that A — * B is a monomorphism, resp., an epimorphism: think ‘exact’. 
This is where we are heading, but the fact that kernels and cokernels do not nec- 
essarily exist prevents us from effectively doing this in a category that is ‘only’ 
additive. 

Incidentally, it is common to denote monomorphisms and epimorphisms by 
suitably decorated arrows; popular choices are > — > and », respectively. 

1.3. Abelian categories. The moral at this point is that if a morphism in an 
additive category has kernels and cokernels, then these will behave as expected. But 
kernels and cokernels do not necessarily exist, and this prevents us from going much 
further. Also, while (as we have seen) kernels are monomorphisms and cokernels are 
epimorphisms in an additive category, there is no guarantee that monomorphisms 
should necessarily be kernels and epimorphisms should be cokemels. In the end, 
we simply demand these additional features explicitly. 

Definition 1.6. An additive category A is abelian if kernels and cokernels exist in 
A; every monomorphism is the kernel of some morphism; and every epimorphism 
is the cokernel of some morphism. j 

As mentioned already, £-Mod is an abelian category, for every ring R. The 
prototype of an abelian category is Ab: this 1 is why these categories are called 
abelian. 

Since kernels are necessarily monomorphisms (by Lemma 1.4), we see that in 
an abelian category we can adopt a mantra entirely analogous to the useful ‘kernel 
■$=> submodule’ of §111.5.3 vintage: in abelian categories, the slogan would be 
‘kernel <=>• monomorphism’ (and similarly for cokernels vs. epimorphisms). 

Remark 1.7. Just as it is convenient to think of monomorphisms A > — >£ as defin- 
ing A as a ‘subobject’ of B, it is occasionally convenient to think of epimorphisms 
as ‘quotients’: if : A > — >B is a monomorphism, we can use B/A to denote (the 
target of) coker <p. We will have no real use for this notation in this section, but it 
will come in handy later on. j 


'There is nothing particularly ‘commutative’ about an abelian category. 
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The very existence of kernels and cokemels links these two notions tightly in 
an abelian category: 

Lemma 1.8. In an abelian category A, every kernel is the kernel of its cokemel; 
every cokemel is the cokemel of its kernel. 


Proof. We will prove the second half and leave the first half to the reader (Exer- 
cise 1.9). 

Let <p : A — * B be the cokeruel of some morphism Z -* A; since A is abelian, 
ip has a kernel i : K — ► A. The composition Z — * A — ► B is 0, so Z — ► A factors 
through l by definition of kernel: 

Z >A-^-*B 

a\ / 

K 

Now let A — * C be a morphism such that the composition K — ► A — ► C is the 
zero-morphism; then so is the composition Z — ► A — * C. Therefore A—*C factors 
through a unique morphism B —* C, 


Z 


o O 

/ a \ 

— TT > B 


31 \ / i 

K 

since <p is the cokemel of Z — » A. But this shows that <p : A 
property defining the cokernel of its kernel K — > A, as stated. 


B satisfies the 
□ 


Putting together Lemma 1.8 and Lemma 1.5, we can rephrase Definition 1.6 
by listing the following requirements on a category A: 

• A is additive; 

• kernels and cokernels exist in A; 

• it <p : A — ► f? is a morphism whose kernel is 0, then <p is the kernel of its 
cokernel; 

• if $ : B — » C is a morphism whose cokemel is 0, then ij) is the cokemel of its 
kernel. 

This is a popular equivalent reformulation of the definition of abelian category. The 
last two requirements should call to mind the exact sequence 

0 i-A-^B-^C >0 

familiar from the /?-Mod context. The reader can already entertain the sense in 
which such a sequence can be ‘exact’ in an abelian category: the third requirement 
identifies A (or, rather, A — » B) with the kernel of ij>, and the fourth one identifies C 
with the cokemel of ip. 
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Now suppose that A — > B is sandwiched between zeros: 

0 > A > »B >0 

in an exact sequence. One of our many Pavlovian reactions would make us want 
to deduce that A — * £ is an isomorphism, because this is the case in I?-Mod, and 
this indeed works in any abelian category: 

Lemma 1.9. Let <p : A-> B be a morphism in on abelian category A, and assume 
that ip is both a monomorphism and an cpimorphism. Then <p is an isomorphism. 

Remark 1.10. This fact does not hold in general additive categories. In fact, 
the reader will encounter in Exercise 3.4 an example of an additive category with 
kernels and cokernels (but nevertheless not abelian) and with morphisms that are 
both monomorphisms and epimorphisms, without being isomorphisms. _i 

Proof. The kernel of ip exists (because A is abelian), and hence it is 0 — * A (by 
Lemma 1.5, since <p is a monomorphism). Similarly, B — ► 0 is a cokernel of <p. 
Further, ip is the cokemel of 0 — » A and the kernel of B — » 0, by Lemma 1.8. 

Now consider the identity B — * B: 


B 



Since B — » B — > 0 is (trivially) the zero morphism and <p is the kernel of B -* 0, 
we obtain a unique morphism ip : B —* A making the diagram commute: 

B 

id 

0 > A >-£-» B to 

As ipip = ide, this show's that <p has a right-inverse. Similarly, consider the identity 
A —» A, as follows: 


A 



The composition 0 — » A — * A is the zero morphism, and <p is the cokernel of 0 — » A, 
so we have a unique morphism T ) : B — » A making this diagram commute: 


A 



0 > A >-£-»B >0 

This says tfp = id 4 , so <p has a left-inverse as well. 

Thus, ip lias both a left-inverse r\ and a right-inverse ip. It follows that ij = ip 
is a two-sided inverse of <p and that <p is an isomorphism as promised (cf. Proposi- 
tion 1.4.2). □ 
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The reader should note the arrow-theoretic nature of this argument. In the 
category I?- Mod we could have given the following (possibly) simpler argument: 
by Proposition III.6.2, monomorphisms are injective and epimorphisms are surjec- 
tive; therefore <p is bijective, and bijective homomorphisms are isomorphisms by 
Exercise III.5.12. Such set-theoretic arguments are not an option in an arbitrary 
abelian category (at least until we develop the material of §2), since the objects of 
an abelian category are not given as ‘sets’. Judicious use of appropriate universal 
properties accomplishes the same goal and may be argued to convey a ‘deeper’ sense 
of why the proven statement is true. 


1.4. Products, coproducts, and direct sums. We will denote 2 the product 
of two objects A, B of an abelian category A by Ax B, and we will denote their 
coproduct by A II B. Both exist, since A is additive. 

The presence of these objects, and of kernels and cokernels, gives us access to 
other interesting constructions. 


Example 1.11. For instance, fibered products (or ‘pull-backs’) exist in any abelian 
category, just as in H-Mod (cf. Exercise III.6.10). Consider a diagram 


B 


-» C 


in an abelian category. The fibered product of A and B over C is an object Axe B 
with morphisms to A and B, completing the commutative diagram 


A*c B ■ > B 




□ 


->C 


and final with this property (that is what the small square in the middle of the 
diagram is supposed to indicate). The fibered product may be constructed in this 
context as the kernel of the difference of the two morphisms 

0 oifi' 

A xfil 

(just as in the particular case of I?- Mod). The reader will prove that these ‘fiber 
squares’ preserve kernels, in the sense that kery> = ip' ° ker ip 1 (Exercise 1.16). 

Similarly, fibered coproducts (a.k.a. push-outs) may be constructed as cokernels 
of coproducts and preserve cokernels. j 


In fact, in H-Mod such constructions are simplified by the fact that finite prod- 
ucts and coproducts coincide, in the sense that the dined sum of two modules sat- 
isfies both universal requirements for products and coproducts: Proposition III.6.1. 


2 As we will see, the single notation A@B will be an appropriate substitute for both notations. 
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But the direct sum of two modules Mi, M 2 is defined by giving a module struc- 
ture to the Cartesian product Mi x M 2 . Again, this strategy cannot be applied 
in a general abelian category, so we have to come up with an alternative. This 
gives us another example of the contrast between set-theoretic and arrow-theoretic 
arguments. 

The coproduct A II B is endowed with morphisms 

A-^AUB, B-^->AUB, 

which are easily checked to be monomorphisms (do this!). In an arbitrary category, 
natural morphisms A II B — * A and A II B — > B are not available (example: Set). 
In an abelian category (and in fact already in an additive category) we do have 
such morphisms: map A -» A by the identity and B -> A by the zero-morphism 
to obtain a unique morphism ka : A LI B — » A, making the following diagram 
commute: 



A 


You can similarly define a morphism 7 Tb : A II B — > B. It follows (by the universal 
property of products) that there is a unique morphism from All B to Ax B making 
the following diagram commute: 



Now we claim that this morphism (ita^b) ■ AUB —> A x B is both a monomor- 
phism and an epimorphism. By Lemma 1.9, this (and a simple induction) will 
prove 

Proposition 1.12. In an abelian category, finite products and coproducts coincide. 

Proving Proposition 1.12 is a good exercise in manipulating arrows and univer- 
sal properties. We will quickly review the ingredients going into the proof that the 
morphism (tt/j, jtb) is a monomorphism. Similar arguments, proving that the same 
morphism is an epimorphism, will be left to the reader. 

In the process, we recover for these objects all the properties we expect on the 
basis of our experience with I?-Mod. Doing it ‘with arrows’ requires a bit of practice 
but is not particularly difficult. 

• hb ■ AU B — * B is the cokemel of ia ■ A — * A II B: 
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Indeed, suppose A— * AH B — >Cis the zero-morphism; stare at 
A — ^ A II B — ^-4 C 



B 


and keep in mind that Kg is determined by the requirements that Kg o Ia = 0 and 
7 rj 5 = ids- We have to show that 7 factors uniquely through kb- Note that if 
it factors at all, then the factorization is unique: indeed, if 7 = Sowg, then 

S = 5o(itbo*b) = (<5°itb)°*b = 

So the only morphism that can work is y a ig. Therefore, consider (70 ig) « ^B- 

— (7°*B)° 7 r B<>M = 0 = 7 »iyi (since KgciA = 0); and 

— ( 7 °*b ) 0 rfB ° *B = 7»*B (since Kgoig = idfi)- 

By the universal property of coproducts, it follows that (7° *b) ° = 7, proving 

that 7 indeed factors uniquely through ■kb- This confirms that kb : A II B — ► B is 
the cokernel of ia- 

• ia'. A — * A II S is the kernel of kb ■ A II B — » B-. 

Indeed, iA is a monomorphism since A — II B ■ * B > A is the identity; 
hence it is a kernel (definition of abelian category!); hence it is the kernel of its 
cokeruel (Lemma 1.8). 

By the same token, ka - A U B -* A is the cokernel of is : B — > A II B and 
ig : B — > A II B is the kernel of ka : A II B — » A. 

• The morphism (ka, Kb) . All B —> A x B constructed above is a mouonior- 
phism: 

Consider the kernel 1 of ( 701 , 7 tb). Composing with the projection to B gives 
the zero-morphism: 

KB 

Kzzz^AUB 

0 

It follows that 1 factors through the kernel of kb, that is, through { 4 , and through ig, 
by the same token. Therefore we have the following commutative diagram: 



The composition K — >A — >A II B — >B is zero (because kb ° M = 0); therefore so 

is the composition K — tB—tB. That is, K — ► B is actually the zero-morphism; 
a fortiori, 1 = 0: the kernel ofAIIB— ♦^xBisO, so this morphism is indeed a 
monomorphism (by Lemma 1.5). 
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The same technique (or an appeal to the opposite category) proves that 

• the morphism (jt^, ttb) : ABB -+ AxB constructed above is an epimorphism. 

We will happily leave this to the reader (Exercise 1,17). 

This will conclude the proof of Proposition 1.12, by Lemma 1.9. □ 

In view of Proposition 1.12, we can adopt for any abelian category A the conven- 
tion of identifying AUB and AxB, calling this object (defined up to isomorphism) 
the direct sum of A and B, denoted A®B. This object comes endowed with natural 
morphisms 

A > — >A®B, B> — >A®B, A&B — A® B — » B, 
satisfying all the properties to which we have grown accustomed. 

1.5. Images; canonical decomposition of morphisms. Yet another example 
of the application of an arrow-theoretic perspective is the existence of ‘canonical 
decompositions’ of morphisms in any abelian category, in terms completely anal- 
ogous to the decompositions studied in Set (§1.2.8), Grp (§11.8.1), Ring (§111.3.3), 
B-Mod (§111.5.4). In each of these previous examples we could simply argue that 
the decomposition was the original decomposition obtained in Set, enriched as the 
case may be with additional structures adapted to the category under examination. 
This is not an option in the context of an abelian category, and yet the decomposi- 
tion holds just as well; every morphism may be written as an epimorphism, followed 
by an isomorphism, followed by a monomorphism. Here is the precise statement: 

Theorem 1.13. Every morphism <p : A —* B in an abelian category A may be 
decomposed as 


v 



where A —> C is the cokemel of the kernel of and K —* B is the kernel of the 
cokemel of <p. The induced morphism <p:C —* K is uniquely determined and is an 
isomorphism. 

This theorem will follow from useful related considerations. Comparing the 
statemeut with its set-theoretic counterpart suggests that 

ker(cokery>) : K > > B 

should play the role of the ‘image of tp\ Again note that the set-theoretic definition 
of ‘image’ is not an option in the general context we are exploring: unlike groups, 
rings, etc., the objects of a general abelian category axe not ‘just’ sets endowed with 
extra structure 3 . But look at your mental image of what the image of a function <p 
is; ours looks like this: 


3 However, we will soon recover a sense in which images in abelian categories can be ‘under- 
stood’ in set-theoretic terms: see Lemma 2.7. 
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A B 



We can fit many ‘subobjects’ in B to which A maps via <p: 


A B 



and imi/p is the ‘smallest’ such subobject; that is, it is initial with the property of 
being a subobject of B through which <p factors. Tins intuition is precisely captured 
by ker(coker<p), once we adapt to thinking of ‘subobjects’ as ‘monomorphisms’: 

Lemma 1.14. Let ip : A — » B be a morphism in an abelian category, and let 
t : K — * B be the kernel of the cokemel of <p. Then 

• t is a monomorphism; 

• tp factors through i; and 

• t is initial with these properties. 


By Lemma 1,4, i is a monomorphism. It is clear that factors through t; 
the composition A — » B — > coker y? is the zero-morphism, so there is a naturally 
induced A — * K by the universal property of kernels. The more interesting part 
of the statement of Lemma 1.14 asserts that if A : L) — >B is any monomorphism 
through which <p also factors, then i must factor uniquely through A; 







572 


IX. Homological algebra 


Proof. Consider coker A: 



Cok A 


Since p factors through A, the composition A — * B — » Cok A is 0; by the universal 
property of coker p, we have an induced map: 



Since K — * Cok^ is the zero-morphism, this implies that 

K >B C -^ Cok A 


is the zero-morphism. Since A : L — * B is the kernel of its cokernel (Lemma 1.8), it 
follows that there is a unique morphism K — > L making the diagram commute, as 
stated. □ 


Definition 1.15. Let p : A — * B be a morphism in an abelian category. The 
image of p, denoted im^, is ker(coker^). The coimage of p, denoted coim p, is 
coker(ker p). j 


Lemma 1.14 motivates the first definition: every monomorphism through which 
p factors must factor uniquely through im p. (Of course in this context the source of 
im p is only defined up to (unique) isomorphism, as is every solution to a universal 
problem.) As for the second definition, the reader will formulate the universal 
property satisfied by coim p (Exercise 1.19). With the convention mentioned in 
Remark 1.7 and allowing for some abuse of language, the (target of the) coimage 
of p : A — > B is the ‘quotient’ A/ ker p. 

Our mental image, and surely yours, suggests that if K — * B is the image of p, 
then the induced morphism p : A — * K, 





should be an epimorphism. This is indeed the case: 

Lemma 1.16. Let p : A -* B be a morphism in an abelian category, and let 
imp : K —* B, coim p : A — * C be its image and coimage , respectively. Then the 
induced morphisms A —> K and C — » B are, respectively, an epimorphism and a 
monomorphism. 


Proof. As usual, we will prove half of the statement and leave the other half to 
the reader (Exercise 1.20.) 
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To verify that 9 ? : .A — * AT is an epimorphism, consider its image K' —* K: 

im u=ker coker Ip coker p 

K' > - — * K -#■ Cok ■ 

Recall that every epimorphism is the cokernel of its kernel (Lemma 1 . 8 ); in partic- 
ular, coker <p = coker(imp). Now, K' — » K is a monomorphism through which <p 
factors: 







Therefore, K‘ — ► B is a monomorphism through which ip factors and ‘preceding’ 
im <p : K —> B. Since the image of ip is initial among such morphisms (as proven in 
Lemma 1.14), necessarily im Ip is an isomorphism. It follows that 0 = coker im Ip = 
coker proving that Tp is an epimorphism by Lemma 1.5, □ 

To summarize the situation, we have the following commutative diagram of 
epimorphisms and monomorphisms factoring a given morphism <p in an abelian 
category: 



Hopefully we have not lost track of our sought-for ‘canonical decomposition’: 

<p: A >B. 

v 

We now see that this amounts to the statement that in an abelian category, there 
is an induced isomorphism linking the coimage and the image of every morphism. 

Proof of Theorem 1.13. A unique morphism ip : K — > C making the above 
diagram commute exists, by the universal property of im ip (Lemma 1.14), as well 
as by the universal property of coimt?. Since <p°ij> = imip is a monomorphism, 
so is ib. Since ipotp = coirn tp is an epimorphism, so is ip. It follows that tb is an 
isomorphism (Lemma 1.9), and letting ip : C — * K be the inverse of ip concludes 
the proof. □ 

Loosely speaking, we can say that, in an abelian category, ‘image and coimage 
are naturally isomorphic’. This is imprecise, since image and coimage are them- 
selves morphisms; the isomorphism holds between the target of coimage and the 
source of image. Of course, for friendly categories such as if-Mod this distinction 
is immaterial. 
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Exercises 


1.1. Prove that if ij/ * <p is ail epiuiorphisin, then tp is an epimorphistn. Prove that 
if ip o tp is a monomorphism, then tp is a monomorphism. 

1.2. Let tp : A — > B be a morphism in an additive category. Prove that —tp is a 
monomorphism, resp., epimorphism, if and only if <p is. 

1.3. -> A preodditive category is a category in which each Hom-set is endowed with 
an abelian group structure in such a way that composition maps are bilinear. Prove 
that a ring is ‘the same as’ a preadditive category with a single object. 

Additive categories are preadditive categories with zero-objects and finite prod- 
ucts and coproducts. Note that the notion of an ‘additive functor’ between pread- 
ditive categories makes sense. (1.12] 

1.4. Let A be an additive category (preadditive would suffice), and let A be an 
object of A. Show that End*(A) has a natural ring structure. 

1.5. Let A, B be objects of an additive category A, with zero-object 0. Since 0 
is both final and initial, HoitiaCA, 0) and Hobia(0, B) are both singletons, so the 
image of the composition 

A — > 0 — > B 

is a single element e of HomA(A, B). Prove that this is the identity element of the 
abelian group HouiaCA, B). (Hint: Prove e + e = e.) In the text, this element is 
denoted 0, the ‘zero-morphism’. 

Prove that for every morphism tp in A, y>»0 = 0oy> = 0. 

1.6. G> Prove that an object A of an additive category A is a zero-object if and only 
if idx equals the zero-morphism A — > A. [§4.2] 

1.7. Give a categorical definition of cokemel in the spirit of the definition for 
kernel given in Example VIII. 1.11, and verify that the ordinary notion of cokernel 
in B-Mod satisfies this categorical requirement. Verify that your definition agrees 
with the one given in Definition 1.2. 

1.8. Let tp : A -* B be a morphism in an additive category A. Prove that i : K — » A 
is a kernel for < p if and only if for all objects Z the induced sequence 

0 > HomA(2, K) > Houia (Z, A) > HomA(L, B) 

is exact. Formulate an analogous result for cokemels. 

1.9. > This exercise completes the proofs of results given in the text. The arguments 
should be constructed so as to mirror the proofs of the companion statements given 
in the quoted lemmas. 

Let A be an additive category. 

• Let r : K -* A be a kernel in A; prove that i is a monomorphism. (Cf. Lemma 1.4.) 
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• Let ip : A — * B be a morphism in A. If p has a cokernel, prove that p is an 
epimorphism if and only if A — * 0 is its cokernel. (Cf. Lemma 1.5.) 

• If A is abelian, prove that every kernel in A is the kernel of its cokernel. 

(Cf. Lemma 1.8.) 

[SI-3) 

1.10. > Prove that the opposite A 0f ’ (cf. Exercise 1.3.1) of an abelian category is 
an abelian category. [§5.3] 

1.11. -> Let A be an abelian category, and let C be any small category. Prove that 
the functor category A c (cf. Exercise VIII. 1.9) is an abelian category. 

For every object X of C, prove that the assignment & ►-> &(X) (with evi- 
dent action on morphisms of A c , i.e., natural transformations) determines an exact 
functor SC : A c — ■ A. [1.12, 1.14, 1.15, 2.15, 5.7] 

1.12. For a ring /?, let R denote the preadditive category with a single object 
determined by R (Exercise 1.3). Prove that the category fi-Mod of left-R-modules 
is equivalent to the full subcategory consisting of additive functors in the functor 
category Ab R (cf. Exercise 1.11). What about the category of right- R- modules? 

1.13. Let R be a commutative ring, and let R-Mod-^ denote the category of finitely 
generated i?-inodules. Show that R-Mod* need not be an abelian category; prove 
that it is, provided R is Noetherian. 

1.14. Let T be a topological space. Recall that a presheaf on T with values in a 
category A is a contravariant functor from a certain category associated with T to 
A (Example VIII.1.5). Define the category of A-valued sheaves on T. Prove that 
presheaves on T with values in an abelian category form an abelian category. (Hint: 
Exercise 1.11.) 

1.15. Consider the presheaf of continuous complex-valued functions If on a circle 5* 
(cf. Exercise VIII.1.6) and the presheaf “if* of continuous complex-valued functions 
that do not vanish anywhere along the circle; the first is a sheaf of abelian groups 
(under +), and so is the second (under •). Use the exponential map to define a 
morphism exp : “if — > If*. 

Prove that exp is not an epimorphism of presheaves 4 : show that its cokernel 
has value 0 over every proper open set of S 1 but is nonzero over S'. 


4 Watch out! Both presheaves satisfy the sheaf condition; cf. Exercise VIII. 1.6. Sheaves 
of abelian groups form a full subcategory of the category of presheaves, and this category is also 
abelian, but the inclusion functor is in general not exact. For example, the exponential map 
V — considered above is an epimorphism of sheaves. 

Also, for every open set V of T, the assignment >— > &(U) defines a functor both on the 
category of presheaves of abelian groups and on the category of sheaves of abelian groups. This 
functor is exact on presheaves (this is a particular case of Exercise 1.11), but the example given 
above shows that it is not exact in general on sheaves. This lack of exactness is responsible for 
the existence of sheaf cohomology. 
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1.16. t> Consider the prill-back diagram 


Axe B * B 




* 


->C 


in an abelian category; prove that the induced morphism from the 
to the source of ker is an isomorphism. 

State and prove an analogous result for push-outs. [§1 .4, §2.2, 


source of ker 
§6.2) 


1.17. > Prove that the morphism A II B -* A x B defined for objects A, B of an 
abelian category is an epimorphism, [§1.4] 


1.18. Formulate a notion of ‘intersection’ of two monomorphisms with a common 
target A > — > Z , B > — > Z in an abelian category. Prove that the intersection of 
the natural monomorphisms A > — A © B , B > — *■ A © B is 0. 


1.19. > Formulate the universal property satisfied by the coimage of a morphism 
in an abelian category. [§1.5] 

1.20. > Let A — > B be a morphism in an abelian category, with coimage A — » C, 
and let C —* B be the induced morphism. Prove that C — ► B is a monomorphism. 
(Cf. Lemma 1.16; you should not use Theorem 1.13, since Lemma 1.16 was used in 
its proof.) [§1.5] 

1.21. -> Let : A — * B be a morphism in an abelian category, and assume 
decomposes as an epimorphism n followed by a monomorphism i: 



v 


Prove that necessarily it — coim and i — im <p. [§7.4] 


2. Working in abelian categories 

2.1. Exactness In abelian categories. Now that we have a good notion of image 
in any abelian category, we can formalize the meaning of ‘exact sequence’. Consider 
a sequence of objects and morphisms in an abelian category: 

ib 

>A — — —*C >••• • 

The sequence is exact at B if 

(1) ^o^ = 0 and 

(2) coker « ker ip = 0. 

The first condition makes the sequence a ‘oomplex’; it tells us that ip factors through 
ker ip. As ker ip is a monomorphism, the universal property of images (Lemma 1.14) 
yields a unique factorization of iirup through keri/«. Similarly, the second condition 
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tells us that ker ij> factors through ker(coker ip) = im ip. This implies that im <p and 
ker ip coincide. The conditions defining exactness can therefore be summarized as 

• im (p — kerip. 

Of course this recovers precisely the condition used to define the notion of exactness 
of a sequence of f?-modules, in §111.7.1. 

All expected elementary connotations of exactness hold in the context of abelian 
categories as in /?.-Mod, as envisioned already in §1. 2-1.3. With notation as above, 
if the sequence is exact at B, then 

^ is a monomorphism if and only if <p is the zero-morphism 
and 

ip is an epimorphism if and only if ip is the zero-morphism; 
the sequence 

0 > A — B — > C M) 

is exact if and only if <p is a kernel of ip and ip is a cokemel of <p, and 

0 >A— P -*B ► 0 

is exact if and only if tp is an isomorphism. 


-*0 


Example 2.1. There is a natural exact sequence 

0 >A > A©B >B 

where A © B is the direct sum defined in §1.4. 

Example 2.2. For a slightly more interesting example, consider a diagram 


D - 








and the associated sequence 


D 


W>V) 


*A@B 




*C 


obtained by letting A © B play both roles of product and coproduct. Then 

• the diagram is commutative if and only if this sequence is a complex; 

• the sequence obtained by adding a 0 to the left, 

0 >D > A<$> B i -C , 

is exact if and only if D may be identified with the fibered product A Xc B 
(cf. Example 1.11); 

• likewise, the sequence 

D >A®B >C >0 

is exact if and only if C may be identified with the fibered coproduct AUiq B. 
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Indeed, the first assertion is trivial; the second and third amount to the explicit 
construction of fibered products and coproducts mentioned in Example 1.11. j 


2.2. The snake lemma, again. The reader is now in the position of making 
sense in any abelian category of all the statements given in §111.7 for sequences of 
i?- modules; this includes the definition of the homology of a complex as a measure 
of the failure of exactness. These facts will be reviewed in §3. However, note that 
while we can now state the snake lemma (Lemma IH.7.8) in any abelian category, 
the (sketch of the) proof given in §111.7 cannot be directly lifted from that context 
and applied to this one, or so it would seem, since the definition of the ‘snaking’ 
homomorphism S given in §111.7 makes use of ‘elements’. Barring more sophisticated 
alternatives, we should provide an arrow-theoretic alternative for the definition of 6. 

This is perhaps not completely immediate. 

As practice, note the following: 


Lemma 2.3. Let 


Axe B 

A 


-*B 




->C 


be a fibered diagram in an abelian category, and assume p is an epimorpkism. Then 
ip/ %s also an epimorphism. 


It takes seconds to realize that this is true in if-Mod, by chasing elements. An 
arrow-theoretic proof is considerably subtler. 


Proof. First, observe that if <p : A — » C is an epimorphism, so is the map A®B — * 
C considered in Example 2.2. Since epimorpliisms are cokernels in an abelian 
category and cokernels are cokernels of their kernels (Lemma 1.8), we see that 
A © B — * C is the cokemel of the natural morphism 

A xc B — ► A© B. 


To prove that <p’ is an epimorphism, it suffices (Lemma 1.3) to show that if 
( : B -* Z is a morphism for which £ o <p' = 0, then C = 0. 

For this, consider the morphism 


A®B 




obtained by using the fact that A© B is a coproduct of A and B. The composition 

AxcB-*A®B-*Z 

agrees with C« g/, so it is the zero-morphism. By the universal property of cokernels, 
we have the factorization 



C 
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for a unique morphism By the commutativity of 



we see that the composition aip : A — > C — > Z is the zero-morphism. Since <p is 
an epimorphism, it follows that Cf = 0. This implies that (0, C) : A ffi B — » Z is the 
zero-morphism, and we are done: £ = 0 as promised. □ 


Here is a quicker proof (for those who have done their homework): if ip is an 
epimorphism, then A © B — > C is an epimorphism, so the diagram is a push-out 
as well as a pull-back (Example 2.2). By Exercise 1.16, coker <p' = cokers = 0; 
therefore <p' is an epimorphism. The reader should have no difficulty now proving 
the statement dual to Lemma 2.3, for fibered coproducts (Exercise 2.2). 

Going back to the snake lenuna, start from a commutative diagram linking two 
exact sequences in an abelian category: 



The main task is to construct a connecting morphism 5 S : ker u — ► coker A; once this 
is done, we should prove the exactness of the sequence displayed in Lemma III.7.8. 
We will leave this latter step to the more enterprising reader (who may want to 
wait until we go through §2.3). 

The enterprising reader should in fact try to construct 5, as an exercise in 
coming up with arrow-theoretic proofs replacing well-understood element-theoretic 
arguments. Feel free to stop reading here and resume when you have tried on your 
own. 


The ‘problem’ is that the universal properties of kernel and cokernel would 
seem to guarantee the existence of morphisms to ker v and from coker A. There’s 
the rub: how on earth are we going to get a morphism from ker v to coker A? (Last 
chance to try on your own!) 


The answer is to view ker u as a cokeruel and coker A as a kernel. This can be 
achieved by constructing suitable pull-back, resp., push-out, diagrams: 

Ml x W, (ker v) - ^ > ker v Lq — > Mo 






Mi >Ni 


coker A 


Of 


> (coker A] 

0 


u to Mo 


5 To avoid unbearably heavy notation, we will write ker m, etc., for the source of the morphism 
kerM, etc. 
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By Lemma 2.3, is an epimorphism since Pi is an epimorphism, and ker P\ , ker P[ 
have matching sources by Exercise 1.16. Analogous (dual) statements hold for the 
second diagram (Exercises 2.2 and 1.16). Putting everything into one commutative 
diagram, 



has exact rows (not columns!). We get a morphism 

£ : Mi x /v, (keri/) > (coker A) IIx, 0 Mq . 


Note that 


e o <r = 0, T<>e = 0 

by the commutativity of the diagram: indeed, the compositions 
Li — > coker A, ker v — > No 

are both zero. Since keri/ plays the role of cokeruel in the top row, Co <r = 0 implies 
that e must factor through keri/, giving a morphism 

e' : ker v * (coker A) IIx, 0 Mo . 

Since coker A plays the role of kernel in the bottom row and r ° e' : ker v — » jV 0 is 
the zero-morphism, d must factor through coker A, finally yielding 

5 : keri/— ► coker A. 


Writing elements out, the reader can verify that this morphism S agrees with the 
connecting morphism S defined in §111.7.3. Together with maps induced by simpler 
considerations, we obtain the sequence of morphisms 

0 ► ker A > ker p > ker v — coker A * ooker ft > coker v * 0 

in our abelian category. We should now recommend that the reader prove the 
exactness of this sequence, by appropriate arrow-theoretic arguments. This would 
be excellent practice for many such facts that we will encounter when we finally do 
begin to develop homological algebra. 

However, assuming that we have already done this work in §111.7.3, by chasing 
elements, is it really necessary to go back and redo it with arrows, in order to check 
the exactness of the sequence in an abelian category? 
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2.3. Working with ‘elements’ in a small abelian category. Our personal 
feeling is that arrow-theoretic arguments are invariably more insightful than their 
element-theoretic counterparts, even if they are ‘harder’ as a rule. At the very 
least, they are more elegant: for example, in the construction of the connecting 
morphism S just given above, there was no need to verify that the result was 
‘well-defined’, while this requires an argument in the set-theoretic construction in 
§111.7.3. Such issues are usually resolved once and for all at the start, when the key 
universal properties are established, and it is not necessary to belabor them later. 

Still , it is true that element- theoretic arguments are usually easier to concoct. 
Lemma 2.3 appears to be a good example: in terms of elements the statement 
of this lemma is completely obvious, less so from the diagrammatic point of view. 
Exercise 2.12 gives another example: providing a purely arrow-theoretic proof of the 
innocent result stated there is not so easy, while this result is essentially immediate 
from a more mundane viewpoint. The leader could wonder whether it may not ‘be 
enough’ to prove such facts — for example, check the commutativity of a diagram 
or the exactness of a sequence — by simpler element-theoretic considerations. 

This is indeed the case. We will present a construction which would suffice for, 
e.g., the purpose of completing the proof of the snake lemma by chasing elements 
rather than arrows. We find this construction instructive and pretty, so we will 
take the time to go through it in some detail. However, be warned that the Rreyd- 
Mitchell embedding theorem (discussed in §2.4) will obliterate the actual usefulness 
of the construction, since it provides us with a much stronger result. Thus, we 
will ask the reader to wade through some material, with the understanding that 
the result that we will quote thereafter (Theorem 2.9) will make that material 
immediately obsolete. Our excuse is that while proving Theorem 2.9 is beyond 
the scope of this book, the construction which we are about to describe is entirely 
within our reach, and good propaganda for the philosophy underlying the functor 
of points; this is important in, e.g., algebraic geometry and takes some practice to 
get used to. In the end, the reader is likely to gain a better understanding of the 
situation by appreciating a weaker result that we can prove entirely than by leaning 
on a stronger result to be taken on faith. However, the hurried reader should feel 
free to skip this material and jump directly to §2.4. 

The idea is to define an appropriate notion of an ‘element’ of an object of an 
abelian category 6 . The reader should look back at the brief discussion at the end 
of §VIII.1.2: the ‘functor of points’ Iia = HomA(_, A) associates to every object 
Z the set of morphisms Z —* A; in several categories, applying this functor to a 
final object in A ‘recovers’ A as a set. Working with final objects with this purpose 
cannot work too well in categories with zero-objects (why?), but the idea underlying 
the functor of points suggests that we look at morphisms 

z:Z-A 

for a fixed object A in an abelian category A as (Z-fiavored?!) ‘elements’ z of A. 
If <p : A — ► B is a morphism in A, it makes sense to talk about the ‘element’ <p(z) 

6 We will systematically put quotes around this term, i.e., write ‘element’, to indicate that 
we are referring to the relatively fancy notion we will introduce. The ‘elements' will be ordinary 
elements of an ordinary (pointed) Bet determined by the object of the category. 
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of B: simply compose morphisms, i.e., 


Z 



It would now be nice if one could simply check the commutativity of a diagram in A 
or the exactness of a sequence by using these newly defined ‘elements’ as if they 
were elements of objects in A. This does not quite work: for example, the simple- 
minded statement of surjectivity at the level of these ‘elements’ is not equivalent 
to the notion of epimorphism in A. Indeed, if tp : A — » B is an epimorphism, it is 
not necessarily the case that ‘for all elements z of B there exists an element z' of A 
such that z = <p{z')': this would amount to saying that every morphism z : Z — * B 
can be lifted to a morphism z ' : Z — * A, 



and this is simply not true in general (for example, in Ab one cannot lift the identity 
Z/2Z — * Z/2Z along the projection Z — * Z/2Z). The problem is really that Horn 
is not right-exact, so it does not preserve epimorphisms. 

Maybe surprisingly, there is a way to fix this problem. 

As a preliminary note, recall that in the distant past (Example 1.3.8) we con- 
sidered the category Set* of ‘pointed sets’, whose objects consist of sets endowed 
with the choice of a distinguished point (here we will denote by 0 the distinguished 
point). All the algebraic structures we have encountered are pointed sets, often 
in more than one way: for example, stripping an abelian group of all its structure 
except its supporting set and the choice of the identity element leaves us with a 
pointed set (in other words, this defines an evident forgetful functor Ab — * Set*). 
The category Set* has a zero-object 0 (the singleton {0}); hence we can talk of 
‘zero-morphisms’ in Set*. Farther, the category Set* has enough structure to make 
sense of the notion of an ‘exact sequence of pointed sets’: we can stipulate that 



is ‘exact’ at T if the (set-theoretic!) image of / equals the ‘kernel’ of g (that is, the 
preimage p -1 (0)). This notion should be taken with a grain of salt: while it is the 
case that S — * T — > 0 is exact at T if and only if S — » T is surjective, it is not true 
that 0 — » T — * U is exact at T in Set* if and only if T — * U is injective (the only 
requirement posed by the exactness of 0 — * T — * U is that the only element of T 
mapping to the distinguished element in U is the distinguished element of T; this 
is weaker than injectivity). The category Set* is, after all, not additive. 

Now we are going to associate to every object A of a (small) abelian category A 
a pointed set A. This will in fact give a functor A — » Set* with amazing properties: 
a morphism will be a monomorphism, resp., an epimorphism, in A if and only if 
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the corresponding function of pointed sets is injective, resp., surjective; the (arrow- 
theoretic) notions of kernel and image in A will correspond precisely to the (set- 
theoretic) kernel and image in Set*; a sequence of objects in A will be exact if and 
only if the corresponding sequence of pointed sets is exact; and it will be possible 
to verify whether a diagram commutes in A by working in Set*. 

The upshot will be that in a sense we can chase a diagram in any small abelian 
category by ‘chasing elements’. For example, working with ‘elements’ suffices to 
complete the proof of the snake lemma, in the sense that the exactness (in A) of 
the sequence obtained with the aid of the connecting morphism defined in §2.2 may 
be verified by chasing ‘elements’, since this verifies exactness in Set*. A proof in 
tf-Mod would transfer word-for-word to give a proof in any small 7 abelian category. 
This hints that the arrow-theoretic viewpoint that we have tried to defend in the 
past several subsections, while elegant, is not really strictly necessary; we will come 
back to this point in §2.4. 

The construction can be given as an application of the abstract language de- 
veloped in §VIII.l; but we promise that the resulting notion will be easy to con- 
template, independently of this language. The idea is simply to take the functor 
of points up to a relation identifying the ‘element’ Z — * A with all ‘elements’ 
W -» Z - A. 

The following abstract nonsense will have a simple-minded explanation. We will 
(often silently) assume that A is small, to steer clear of any possible set-theoretic 
subtlety. It is convenient to consider a companion category A^, with the same 
objects as A but in which 

Hom A _ (Z, W) = {epimorphisms W »Z in A}. 

This is legal, because the composition of two epimorphisms in A is an epimorphism; 
but note that A— is no longer additive (the sum of two epimorphisms need not be an 
epimorphism, so we lose the algebraic structure on Horn). Also note the reversing 
of arrows; this is ill order that the functor that we are about to define be covariant. 

We will use A_ as ‘index’ category for a limit. For a fixed object A of A, we 
define a covariant functor 

.#? A : A- -* Set* 

by ‘restricting’ the functor of points Ha to A_ and preserving the information of 
the zero-morphism. That is, we set 

jr 4 (Z) = Hom A (Z,A) 

for all objects Z of A_ (that is, of A), viewing HomA(Z, A) as a pointed set by 
distinguishing the zero-morphism, and define on morphisms by composition: 
for a morphism Z — » IV in A_ (that is, an epimorphism a : W -» Z), 

W-^-»Z 

A 


7 If the relevant diagrams only involve finitely many objects, the smallness hypothesis is not 
restrictive. The interested reader should research this issue; we will shamelessly gloss over it. 
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define 

#a(Z) - Jt A (W) 

by sending z to w = z o a. Of course the zero-inorphism is mapped to the zero- 
morphism, so this is indeed a morphism in Set*. 

Definition 2.4. Let A be an object of a small abelian category A. The pointed 
set A of ‘elements of A' is the colimit of 3&a' 

A := lim 3 & a - j 


Here is the simple-minded explanation. In the context of pointed sets, colimits 
may be constructed by joining pointed sets at the distinguished point and then 
taking a suitable quotient. For the case at hand, this translates into the following: 
an ‘element’ of A consists of the choice of a morphism z : Z —> A in A, stipulating 
that two morphisms Z\ : Z\ — * A, zi : Zi —> ► A determine the same ‘element’ if 
there are ept'morphisms W\ : W -» Z\ and wi : IF -** Zi in A such that the diagram 



commutes. This is an equivalence relation, as the reader should check carefully 
(Exercise 2.5). 

Now we want to make the assignment A*-* A into a covariant functor A — > Set*, 
and there is only one reasonable way to do so: for if : A — > B and z : Z —* A, 
define <p(z) by composing morphisms, i.e., 


A — B 


z 


This prescription is (manifestly) compatible with the equivalence relation, so it 
defines a set-function <p : A — » B. The image of the zero-morphism is zero, so <p 
is a morphism in Set*, as needed. The covariance property of this assignment is 
evident. We have obtained the promised functor A — > Set*. 

Now we will proceed to verify all the miracles we have advertised. In the 
following, the symbol ~ will stand for the equivalence relation defined above 8 ; 
thus, two morphisms z\ : Z\ — * A, Z\ : Zz — ^ ► A determine the same ‘element’ of A 
if and only if z\ ~ Z 2 . 

One useful preliminary observation is that the only morphism representing the 
distinguished ‘element’ is the zero-morphism and that we can detect whether a 
morphism is 0 by working in Set*: 


8 Warning: Later on in this chapter we will define a notion of homotopy, and we will recycle 
the symbol ~ for that notion. The context and the meaning of the symbol will be completely 
different. 
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Lemma 2.5. z ~ 0 •$=> 2=0. Further, a morphism p : A — * B in A is 0 if and 
only if p(z) = 0 for all z € A. 


Proof. According to the definition given above, z : Z — * A is equivalent to 0 if and 
only if there is an epiinorplrisni W -» Z making the following diagram commute: 


W 




A 


Since W -*• Z is an epimorphism, this diagram commutes if and only if z = 0 
(Lemma 1.3), as claimed. 

For the second statement, consider the situation 



If ip = 0, then so is i p(z) — poz. Conversely, if <p(z) = 0 for all z, then taking 
2 : Z = A — * A to be the identity, we get p ~ 0, and hence p = 0 by the first 
part. □ 


This tells us in particular that we can verify the commutativity of a diagram 
in A by working in Set*: because we can check whether two morphisms p, ip € 
HouiaM, B) are equal by verifying that their difference ( p - i> equals 0, which we 
can do 9 at the level of pointed sets (hence, by chasing elements!) by Lemma 2.5. 

Concerning nionomorphisms and epimorphisms, 

Lemma 2.6. Let <p : A — * B be a morphism in A. Then 

• ip is a monomorphism if and only if $ is injective; 

• p is an epimorphism if and only if i p is surjective. 

Proof. As usual, we will propose a division of labor: the reader will prove the first 
statement (Exercise 2.6), and we will prove the second. 

Assume <p is an epimorphism, and let z : Z — * B represent an arbitrary ‘element’ 
of B. Consider the fiber product: 

A — » B 

z' * 

Ax b Z —^-4 z 

By Lemma 2.3, p' is an epimorphism. It follows that poz' ~ 2 , that is, p(z') = 2 . 
This shows that p is surjective. 


CJ Note that we are not saying that p = >j> if and only if p(z) = <p(z) for all ‘elements’ 2 ; this 
is unfortunately not the case (cf. Exercise 2.10). 
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Conversely, assume <p : A — ► B is surjective. In particular, there is an ‘element’ 
of A mapping to ids. This ‘element’ is represented by a morphism z \ Z — * A, 
and the condition <p(z) = ids means that there are epimorphisms w\ , v> 2 , as in the 
commutative diagram: 



Since (p«zoWi = W2 is an epimorphism, it follows that tp is an epimorphism, as 
needed. □ 


Next, let’s verify that the notions of ‘kernel’ and ‘image’ in an abelian category 
A match precisely their simple-minded counterparts in Set*. 

Lemma 2.7. With notation as above, let <p : A — ► B be a morphism in a small 
abelian category A, and let <p : A —* B be the corresponding function of pointed 
sets. Let kery> : K — *■ A, resp., im : I -+ B, be the kernel and the image of <p, 
respectively. Then 

• ker tp identifies with the subset <p~*(0) of A; 

• imy> identifies J with the image of ip in B. 

Proof. These statements are very dose to the universal properties satisfied by 
kernel and image. 

The reader will verify the statement about the kernel (Exercise 2.7). For the 
image, recall that we have a decomposition of ip, 

<p : A »/>-! ^Ub, 

and that \m<p : I — ► B is a kernel of coker (this is the definition of ‘image’ 
(cf. §1.5); the fact that A — > J is an epimorphism is part of the content of Theo- 
rem 1.13). 

To simplify the notation, let j = im <p. Then (by Lemma 2.6) j is an injective 
function I —* B, mapping a representative z : Z I to jez : Z —* B\ we have to 
verify that )(t) = im<p. To verify that ](z) is in the image of <f>, consider the fiber 
product 





Axj Z » Z 


The bottom map is an epimorphism by virtue of Lemma 2.3, since the top map 
in the liber square is an epimorphism. This shows that <p°z' ~ j( z )> that is, 
j(z) = <p(z') is in the image of <p, as claimed. This verifies that the image of j is 
contained in the image of tp: 

I — > image of <p C B. 
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To show that every ‘element’ in the image of tf> is obtained in this fashion, let 
z: Z —* B be in the image of <p. That is, there is a z' : Z' — * A and epimorphisms 
Wi, W 2 making the following diagram commute: 



, B ^ C ok 



Note that coker tp a z&W 2 = (coker tfioip) e Z oinj = 0; since is an epimorphism, 
it follows that coker tpc,z = 0. By the universal property of kernels, this says that 
z factors uniquely through ker coker <p = im ip = j: 


A 



Z 


This gives an ‘element’ of / mapping to z, as needed. 


□ 


Finally, let’s check that the functor is exact: 

Proposition 2.8. Let A be a small abelian category. Then a sequence 


A — > B — ► C 


in A is exact if and only if the corresponding sequence 


A-u s -JL> e 


is exact in Set* . 


Proof. This now follows immediately from Lemma 2.7: exactness in A means that 
imy* = ker^f>, and exactness in Set* means that the image of <p equals $ -1 (0). By 
Lemma 2.7, these conditions are equivalent. □ 

2.4. What is missing? Pretty as it is, the construction presented in the previous 
section has several shortcomings: 

• The structure of ‘pointed set’ is less rigid than, say, that of an abelian group; 
since Ab is itself an abelian category, it would be more natural to laud in Ab 
than Set*, if possible 10 . 

• While we can check that a diagram commutes in A by performing some com- 
putation in Set* (cf. the comments following Lemma 2.5), we have stopped 
short of claiming that the functor A — ► A is faithful, that is, that it induces 
injective functions Horn^A, B) —* Horuset' {A, B) for all A, B. In fact, the 
functor is not faithful (Exercise 2.10). 

10 One way to do this would be to take the colimit in Ab rather than Set*, but we will not 
pursue this possibility here. 
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• The functor A — » Set* is not full; that is, it is not surjective on Hom-sets. In 
other words, there will be many functions of pointed sets that are not induced 
by morphisms in A. Thus, we cannot use ‘elements’ to construct morphisms in 
A: the arrow-theoretic work performed in §2.2 in order to construct the con- 
necting morphism remains necessary (even if the properties of this morphism 
can then be verified by using ‘elements’). 

Here another miracle occurs: it is in fact possible to construct fully faithful, 
exact functors from any given (small) abelian category to a category of modules. 
Here is the precise statement, which will be left without proof in this book: 


Theorem 2.9 (Rreyd-Mitchell theorem). Let A be a small abelian category. Then 
there is a fully faithful, exact functor A — * /2-Mod for a suitable ring R. 


Caveat: In this statement R is not necessarily commutative; /2-Mod denotes 
the category of left-/2-modules. 

For a reminder on the full & faithful terminology, see Definition VIII. 1.6. 
Briefly, a functor & : A — » C is fully faithful if it induces a bijection HomA(A, B) — » 
Homc(^(i4),^(B)) for all objects A, B in A. In practice, such a functor identifies 
A with a subcategory of C. The exactness (cf. Example VTII.1.18) of the functor 
A — » /2-Mod ensures that kernels and cokernels match in the two environments, and 
so on (note that the functor is automatically faithfully exact; see Exercise 2.16): 
everything we laboriously checked in §2.3 for our poor man’s functor A — » Set* 
holds for the much fancier functor provided by the Ereyd-Mitchell theorem. 

This functor is fully faithful: this means that one can in fact construct mor- 
phisms in an arbitrary (small) abelian category by working with elements. Indeed, 
this amounts to constructing the appropriate morphisms in the ambient category 
/2-Mod, and fullness guarantees that these morphisms ‘already’ exist in A. 

So, at the end of the day we are really allowed to forget all our arrow-theoretic 
work. Element-theoretic considerations suffice, even for the purpose of constructing 
morphisms such as the ‘connecting morphism’ discussed in §2.2. Monomorphisms 
really can be assimilated with ‘subobjects’, and ‘quotients’ in an abelian category 
really are quotients in the ordinary sense. 

This is impressive: it says that doing homological algebra in the category of 
left-modules over a ring is essentially as general as doing it in an arbitrary (small) 
abelian category, at least for what concerns constructions that do not require the 
category to have special objects (such considerations will play a role in §5 and 
following). In the rest of the chapter we will adopt the common compromise of dis- 
cussing the basic notions of homological algebra in the context of abelian categories 
while reverting back to element-theoretic considerations whenever this makes life 
substantially easier. The Freyd-Mitchell theorem allows us to do this unapologeti- 
cally, and often so would the simpler considerations of §2.3. 
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Exercises 


2.1. In an abelian category, prove that A — >C is exact at B if and only 
if t and coker ip (both of which are morphisnis with B as source) have the same 
kernel, if and only if <p and ker t/> (both of which are morphisms with B as target) 
have the same cokernel. 


2.2. > Consider a push-out diagram 

D - — >B 



A — 7 - ► A lip B 


in an abelian category. Prove that if a is a monomorphisin, then a' is a monomor- 
phism. (C'f. Leinina 2.3.) [§2.2] 


2.3. -> Prove the ‘four-lemma’ (cf. Exercises III. 7. 12 and III.7.13) in every abelian 
category. In fact, show that it is only necessary to prove one of the two forms of 
the lemma, for the other then follows automatically. [III.7.13] 

2.4. -> Prove the ‘short five-lemma’ (cf. Exercise III. 7. 10) in any abelian category: 
consider a commutative diagram 


0 

0 


A 


+ M, 
/* 

-> Mo 


-> N 1 >0 

V 

+ N 0 >0 


with exact rows in any abelian category, and assume that A and v are isomorphisms; 
prove that /1 is ail isomorphism, by an explicit arrow-theoretic chase of the diagram. 
[ 2 . 11 ] 


2.5. > Prove that the relation used in §2.3 to define the notion of ‘element’ of an 
object of ail abelian category is indeed an equivalence relation. [§2.3] 


2 . 6 . > Prove that a morphism <p : A —> B in a small abelian category is a monomor- 
pliism if and only if it is injective 011 ‘elements’. (Cf. Lenmia 2.6.) [§2.3] 


2.7. > Let p : A —> B be a morphism in a small abelian category, and let ker 9 : 
K —> A be a kernel of p. Prove that the set of ‘elements’ of K may be identified 
with <£~ l (0), where p : A — * B is the induced function on the corresponding sets 
of elements. (Cf. Lemma 2.7.) [§2.3] 

2.8. -> Let A be an abelian category, and let A be an object of A. Prove that 
id ,4 : A — * A and -id 4 : A — » A determine the same ‘element’ of A in the sense 
of §2.3. [2.10] 

2.9. Prove that the equivalence relation ~ introduced in §2.3 does not preserve the 
addition in Hom-sets, in the sense that ^ ~ i’ =*> (ip + T]) ~ (rfr + rf). 
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2.10. t> Let A denote an abelian category. Prove that the functor A — » Set*, A t-» A 
defined in §2.3 is not faithful in general. (Use Exercise 2.8.) [§2.3, §2.4] 

2.11. Upgrade the result of Exercise 2.4, by proving the five-lemma: if 

Kt ► L, ► M\ 1 N\ > Oi 

i I' I' l~ .. 

K 0 > Lq > Mo 1 No ► Oo 

is a commutative diagram with exact rows in a (small) abelian category, with 
monomorphisms, epimorphisnis, isomorphisms as indicated, then /t is an isomor- 
phism. Use ‘elements’ and the material developed in §2.3. 

2.12. t> Let A : M — > L, v : M — > N be morphisms in an abelian category: 

L M — — > N ■ 

Prove that 11 

im A ini v 
A(ker v) t»(ker A) ’ 

(Hint: By the considerations in this section, this can be proved by using elements; 
in fact, by the FYeyd-Mitchell theorem it suffices to prove it in fi-Mod.) (§2.3, §9.3] 

2.13. By refining the construction in §2.3, one can show that every small abelian 
category can be embedded in Ab. However, it is not always possible to do it fully. 

Indeed, prove that the abelian category of finite-dimensional R-vector spaces is 
not equivalent to any full subcategory of Ab. (Hint: If it were, the one-dimensional 
vector space R would correspond to an abelian group A such that EndAb(A) = 
EndR.vect(R). But look at Exercise VI.1.13.) 

2.14. -i Let A be an abelian category, and let A' be a full subcategory of A. Prove 
that A' is abelian and the inclusion functor A' C A is exact (and in particular it 
preserves kernels and cokernels) if and only if 

• A' contains the zero-object of A, 

• A' is closed under 0, and 

• A' is closed under kernels and cokernels. 

[2.15, 2.17, 3.3] 

2.15. — > Let A be a small abelian category. Construct a category F of additive 
contravariant functors A — » Ab, with natural transformations as morphisms, and 
show that it is abelian. (Use Exercises 1.11 and 2.14.) [2.17] 

2.16. t> Let A, B be abelian categories, and let & : A — » B be a faithful, exact 
functor. Prove that it is faithfully exact: a sequence X — » Y — ♦ Z is exact in A 
if and only if the corresponding sequence &(X) — * &{Y) — * ^(Z) is exact in B. 
[§2-4] 

n Here A(kerv) denotes the image of the restriction of A to the source of kerv, etc.; see 
Remark 1.7 for ‘quotients’. 
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2.17. Upgrade the Yoneda lemma (Exercise VIII.1.10) to prove that every small 
abelian category A is equivalent to a full subcategory of the category F of Exer- 
cise 2.15, by means of the functor assigning to each object X in A the functor 
hx = HomA(_,A). 

Prove that this Yoneda embedding is left-exact and ‘reflects exactness’ in the 
sense that X — > Y — * Z is exact in A if the corresponding sequence hx —* hy —* hz 
is exact at hy (that is, if hx(A) — * hy(A) — ► hz(A) is exact at hy(A) for all A 
in A). 

This is the beginning of a proof of the Freyd-Mitchell theorem. Since each 
hx is left-exact, the Yoneda embedding lauds in the subcategory L of F consisting 
of left-exact additive contravariant functors. It turns out that L is abelian in its 
own right (although its embedding in F is not exact; thus, Exercise 2.14 doesn’t 
help), and the Yoneda embedding of A in L is exact. Finally, one proves that L 
is equivalent to the category of modules over a ring (the endomorphism ring of a 
‘faithfully projective’ object), and the Freyd-Mitchell theorem follows. 


3. Complexes and homology, again 


After all these preliminary considerations, it is time to begin thinking in earnest 
about homological algebra. We will be working in a fixed abelian category A; by 
what we have seen in §2, we are allowed to pretend that the objects of A are 
ordinary modules over a ring, at least if A is small. Thus we will deal with objects 
and morphisms with the aid of usual set-theoretic considerations 12 . Until we get 
to more sophisticated material, the reader wall lose little or nothing by taking A to 
be 7?-Mod for some ring 13 R. 

We will start off with a reminder of what we did in §111.7.1. Once more, as we 
now know, everything can in fact be done in the more general setting of an abelian 
category. 


3.1 . Reminder of basic definitions; general strategy. A chain complex (M„ d , ) 
in A is a sequence of objects and morphisms, 


Si. 

► M i+ i — 


4M 


di ,» Si- 1 
— ► Mi- 1 


such that (Vi), di °di+t = 0. We can just as well use ascending indices (which are 
then traditionally written as superscripts), 


• M i+1 


d i+» 


and impose d l « d‘ 1 =0, with no change in the mathematics other than in the 
nomenclature. This is a cochain complex (M*,d*) (or M* for short), leading to 


12 In so doing, we will sweep the ‘smallness* hypothesis under the carpet. This imprecision 
should be harmless in any matter concerning only countably many objects of A, such as construct- 
ing a diagram or checking that a diagram commutes. We will not deal with anything fancier than 
this. 

13 In fact the reader may even assume that R. is a commutative ring; commutativity plays no 
essentia] role in the considerations in this chapter. 
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cohomology, etc. The morphisms d* are the differentials of the complex. It is not 
uncommon to call i d’ all differentials within sight, decorating them by the name of 
the complex if necessary to avoid confusion. So, would implicitly denote the 
differential of the complex M* . 


Whether to work ‘homologically’ or ‘cohomologically’ is essentially an aesthetic 
question, until it becomes dictated by the specific context of an application. In §111.7 
we chose homology; in this chapter we will choose cohomology: thus, indices will 
be increasing upper indices. Setting = M~ l reconciles the two conventions. 
Thus, a chain complex (M«, d») may be (and usually is) viewed as the cochain 
complex (M~ m , d~'). 

The condition d * » d‘~ 1 = 0 is equivalent to imd 1-1 C kerd*. The complex is 
exact at M‘ if find* -1 = kerd'; it is ‘exact’ if it is exact everywhere. The i-th 
cohomology of a complex (M*, d*) measures its ‘deviation from exactness’ at M‘: 


H\M') 


ker d 1 
im d' -1 ' 


Setting = H ‘(M *) gives homology from cohomology, if desired. 


Remark 3.1. In an abelian category, the definition of cohomology should be parsed 
as follows (cf. Remark 1.7). Let I — > M' be imd* -1 , and let K — » M ’ be kerd*; 
since M* is a complex, imd* -1 factors through kerd 1 , giving a monomorphism 
I —¥ K; then W(M m ) is the co kernel of this morphism. We will use the ‘quotient’ 
notation as a good shorthand for this operation. j 


The following definition will undergo a few refinements later in the chapter: 

Definition 3.2. (Preliminary) A resolution of an object A is a complex whose 
cohomology is concentrated in degree 0 and isomorphic to A. _i 


To be more precise, the datum of a resolution of A should include a specific 
isomorphism of the cohomology of the complex with A; a common abuse of language 
glosses over this point. In fact, tins isomorphism in cohomology should be induced 
by a morphism at the level of complexes, from the given complex to the complex 
called t(A) later in this section. The notion of quasi-isomorphism (Definition 4.3) 
will give us the correct framework in which to talk about resolutions precisely. 

It is common to assume that resolutions M* are ‘bounded’ above or below, 
that is, M* = 0 for i > 0 or i < 0. For example, a complex 

— * M -2 4 M~ l ——4 M° *— — > 0 >0 

is a resolution of A if coker d -1 = A, and the complex is exact otherwise. Down the 
road we will restrict our attention to bounded (-above or -below) resolutions. In 
a sense, this is not a very serious restriction: any resolution may be truncated (at 
the price of changing the degree-0 term) in order to produce a bounded resolution 
(Exercise 3.1). Note that the datum of a bounded-above resolution is equivalent to 
that of an exact complex 

• • ^ Mj ^ M\ 1 Mo ^ A ^ 0 ; 

this is the viewpoint we took in our previous encounter with resolutions, in §VI.4.2. 
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The complex /(A), 

>0 -^->0 >0 , 

is a resolution of .4. hut not a very interesting one. Resolutions become useful if 
the objects A/' are 'special' in some sense (usually, A should not be expected to be 
special): for example, in the context of modules over a ring we could require the 
objects A/' to be free, flat., projective, etc. We will then say that the resolution 
itself is free, flat, projective, etc. 

There are many contexts in which the following strategy produces interesting 
invariants: 



Both Ext and Tor are examples of this general plan of attack: start with two /?- 
modules A/, N: find (for example) a free resolution of A/, and tensor it by A\ 
obtaining a chain complex: then the homology of this complex computes the mod- 
ules Torf(A/, N) (cf. fjVIII.2.4). Algebraic topology is a source of many other 
examples and was in fact historically the main motivation driving the development 
of homological algebra. In the 1950’s it became clear that this tool would be ex- 
tremely useful in other fields: for example, it had a tremendous impact on algebraic 
geometry, through the work of Alexander Grothendieck 1,1 and others. 

Now consider this strategy carefully. In every single useful application, there 
are in fact many ways to associate a complex to a mathematical object: in the 
example of Tor recalled above, there are infinitely many different, free resolutions of 
a given module; we have in fact claimed (in §VIII.6.4) that resolutions by projective 
modules, or even flat modules, would all yield the same Tor modules. In general, 
there is a gigantic degree of freedom in the choice of the appropriate complex 
associated to the given mathematical object, and one of our main goals here will 
be to show that (for a large class of interesting examples) the (co)homology of the 
complex will not depend on this choice. 

The inquiring reader should not stop there, however. Precisely because of this 
large ambiguity, the informational gap from the complex associated to a mathemati- 
cal entity to its cohomology is large. Could one do better than ‘taking cohomology’? 
The natural approach would be to determine precisely where the ambiguity lies and 
‘mod out’ the choice of a complex by this ambiguity. Whatever one gets from this, 
it must be enough to determine the cohomology; but there may be a large amount 


'■'Grothendieck’s 1957 paper ”Sur quelques points dalgfebre homologique" was so influential 
that you can just refer to it by naming the journal in which it appeared. If you say “This is 
proven in Tohoku", without other qualifiers, most algebraic geometers will understand that you 
mean this one paper of the thousands published in this journal in the past 50 years. 
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of interesting information carried by ‘complexes modulo ambiguity’ which is lost 
by going ‘all the way down 1 to cohomology. 

This is what we are up to: proving that the cohomology of the associated 
complex is indeed independent of the choices, while keeping an eye out to detect 
where the key information is really stored. 

3.2. The category of complexes. To being this exploration, we assemble cochain 
complexes in A into a new category C(A): 

* Obj(C(A)) = {cochain complexes in A}; 

• for M* = N* = (N*,d? N ,) cochain complexes, Hom C ( A )(A/*, N*) 

consists of the commutative 15 diagrams 


(*) ► M*-i M i M i+ i *••• 


• 

in A. We will denote by a* the morphism determined by the collection a*. Every- 
thing that should be checked in order to prove that C(A) is indeed a category should 
be evident. The following is also essentially evident, hut a little more interesting: 

Lemma 3.3. C(A) is an abelian category. 


-> N 1 - 1 ■ 


-*N* 


a ' 41 

<• r 


-> jv 1+1 


We will leave to the reader the careful verification of this fact (Exercise 3.3). 
In broad terms, morphisms between two given complexes form an abelian group, 
essentially because if a* and : M‘ — » N l are both collections of morphisms 
making the appropriate diagram commute, so is the collection of their sums a* 4- 0‘ . 
Finite products and coproducts exist in C(A) as an easy consequence of the fact that 
they do in A. As for kernels and cokernels, the commutativity of each square in (*) 
and the universal properties of kernels and cokernels in A guarantee the existence 
of morphisms making the larger diagrams 


K* K i+1 


kera i+1 


M i M ' +1 


N‘ 


coker a 1 


coker a * +1 


c* — C i+1 


^Note the requirement that such diagrams commute. It will also be important to consider 
diagrams that do not satisfy this requirement but 4 morphisms of cochain complexes’ do. Actually, 
a strong case could be made for requiring morphisms of cochain complexes to correspond to anti • 
commutative diagrams, but conventions are what they are and we will adhere to them. 
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commute. The resulting sequences K*, C* are immediately checked to be com- 
plexes, and the collections kera* : K' — * M', coker a' : N' -* C' give morphisms 
of complexes kera*, coker a* satisfying the universal property for kernels and cok- 
ernels in C(A). The reader will check that every monomorphism in C(A) is a kernel 
and every epimorphism is a cokernel by using the fact that this is the case in A. 

Thus, we have a way to concoct a new abelian category from an old one. The 
new category C(A) is a natural starting place to study resolutions of objects in 
A, and it comes with the same general package of tools available in every abelian 
category: we can talk about exact sequences of complexes, we have a canonical 
decomposition of morphisms of complexes, and so on. A sequence of complexes 

> L* > M* > N* > ■ ■ ■ 

is exact in C(A) if and only if all sequences 

> L‘ > M‘ > N‘ * • • • 

are simultaneously exact in A. Complexes of complexes will be important later on 

(§8.2). 

Popular variations on the definition of C(A) require the complexes to be bounded 
above or below: 

• C + (A) denote the full 16 subcategory of C(A) determined by complexes L * 
which are bounded below, i.e., for which L' = 0 for t <K 0. 

• C~ (A) likewise denotes the full subcategory of bounded- above complexes. 

These are also abelian categories, and so are further variations such as C-°(A) 
(complexes L* for which V = 0 for i < 0), C-°(A), and so on. These bounded 
variants become unavoidable when dealing with resolutions. 

It will also be necessary to consider categories of complexes in which the ob- 
jects are restricted to belonging to a subcategory of A. We will devote particular 
attention to the full subcategories P and I consisting of projective, resp., injective, 
objects in A (see §5.3); thus, C-°(P), C-°(l) and other variations on the same 
themes are fair game 17 . In the generalities that follow we will only deal with C(A), 
to keep notation under control. The reader should have no difficulty determining 
what extends immediately to the different variants of C(A) and what does not. 

There are several functors and other operations available in this new setting; 
many of them seem almost too simple-minded to mention. For example, for all 
integers r we have a fully faithful, exact functor 

l r : A ♦ C(A) 

sending an object A of A to the complex 

>0 >0 * A >0 >0 »••• 

T 

degree r 


ls That is, morphisms between two objects in the subcategory agree with morphisms of the 
same two objects in the ambient category. Cf. Exercise 1.3.8. 

17 But note that P and I are not abelian categories! 
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placing A in degree r and 0 in all other degrees. It is common to identify A with 
its ‘image’ in C(A) via 4 = to- We also have ‘shift’ functors 

C(A) *C(A), 

M* t > M[r]* 

defined by setting 18 M[r]' = M t+r , = (-l) r d l A 'fI. Complexes can be ‘trun- 

cated’ by replacing all terms of degree > r (or < r) with 0. 

These operations turn out to be more important than they may look at first. 
The importance of the following example is, on the contrary, apparent. We claim 
that cohomology is a functor: 

Lemma 3.4. For every integer i, the assignment 

H ‘ : M* I 

defines an additive covariant functor C(A) — > A. 

Proof. Of course, the statement means that each H* induces in a natural (and 
functorial) way homomorphisms of abelian groups 

Horn -> Rom A (H i (M'),H i (N m )) 

for all complexes M*, N*. This follows from the commutativity requirement in the 
definition of morphisms of complexes. Look at the relevant part of the diagram: 



The composition d 1 ’ » a* = a ,+1 » d* is 0 on her by the commutativity of the square 
on the right, so the restriction of a * to kertf factors through kerd". Composing 
with the projection gives a morphism 

kerd‘ - H^N*). 

This morphism is 0 when restricted to imd ,_1 , by the commutativity of the square 
on the left. Therefore a 1 induces a morphism 

IF (AT) 

in A, as needed. It is clear that this assignment is covariant. □ 

We can also view cohomology as a functor C(j 4) -+ C(i4), by placing each 
cohomology object H*(M*) in degree i, connected by zero- morphisms, obtaining a 
complex Note that the cohomology of this complex equals the cohomology 

of M*, that is, 

= H*(M •). 

In this sense, taking cohomology feels like performing a ‘projection’. 

18 The sign in the differentials is inserted in order to avoid other annoying signs elsewhere; 
signs do not change the isomorphism class of the complex (Exercise 3.6). 
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3-3. The long exact cohomology sequence. The one fact we did explore at 
any level of thoroughness in §111.7.1 is the famous snake lemma, Lemma III. 7.8, 
which has come back to entertain us in §2.2. Written cohomologically and with 
the benefit of the language introduced in §3.2, the snake lemma takes the following 
form. Consider a commutative diagram 


0 

0 


a" 

■>L° 


+ A/ 1 
n" 

*M° 


■» A M 
S' 

*N° 


with exact rows, and view the columns as complexes 


*0 


+ 0 


L' : > 0 ► L° L l * 0 * 

A-/*, A’*. The commutative diagram is then nothing but the datum of a short exact 
sequences of (very special) complexes: 


0 » L* > A/* *• IV* > 0 ■ 

The snake lemma tells us that there is then an exact sequence in A: 

0 * H°(L*) > 

( *“ ) 

Here, H n , resp., H l , simply stands for the kernel, resp., cokernel, of the correspond- 
ing map: nothing else is going on in these small complexes. The (element-theoretic) 
definition of the ‘connecting’ morphism 5 is discussed thoroughly in §111.7.3 and can 
be summarized (cohomologically speaking) as follows: 

• Start with n € H°(N 9 ) = ker j/° C N°. 

• Choose any preimage m of n in A/ 0 . 

• Map m to /i (1 (m) e A/ 1 . 

• It is immediately seen that p°(m) maps to 0 in IV 1 ; therefore it determines a 
unique element ( 6 L* . 

• Set 8(n) = the class of ( in H l (L‘) = coker(A°). 

Pictorially, 


<5(n) 



We checked in §111.7.3 that choosing a different preimage for n in A/ 0 , while it may 
change the element £, does not change its image in coker(Ao). Thus 6 is well-defined. 
As we have seen in §2.2, it is relatively straightforward to provide an arrow-theoretic 
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version of this construction; but we have also seen (Theorem 2.9) that we do not 
need to do that, so we will stay with elements in what follows. 

Now we will upgrade this construction, in what should come across as a very 
natural development. In fact, what we are going to do could be absorbed into the 
snake lemma itself (cf. Exercise 3.10); but we have always found the ‘spread out’ 
chase compelling, so here it is. Take any short exact sequence in C(A), 

0 ► M* >N* >0 , 

linking three full-blown cochain complexes £*, M*, JV*; that is, consider a large 
commutative diagram 


t i t 

0 > L i+2 > M i+2 > N i+2 * 0 

0 > L i+1 ► M‘ +1 » N i+1 » 0 

0 » V * M‘ > N l >0 

P *" 1 

0 > L*- 1 * M‘ _I * 1V <_1 * 0 

| 



in A, where the rows are exact and the columns are complexes. The connecting 
morphism constructed for the snake lemma still gives an interesting morphism 

. L i+1 

S : her v l — » coker A* = : — r-r , 
im A‘ 

obtained by ‘climbing the ladder’ in precisely the same way as for the snake lemma: 



Where does the image 6(n) really lie? It is represented by an element l in L‘ +1 , 
determined by an element fj.‘ (m) € im The image of n‘(m) in M ,+2 must be 0, 
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because the central column is a complex: 


0 

0 

0 

0 


01 >0 


I I 

£1 ¥p'(m) 



I. 




A 




0 

0 

0 

0 


It follows that f 6 kerA 4+1 , and therefore we cau place S(n) in cohomology, 

5(n) € 

iin Aj 

and view S as cl morphism 

5 : kerv* —> H i+l (L*). 

Next, the kernel of v' contains the image of i/* -1 , again because columns are com- 
plexes. What is the restriction of 8 to imv‘ _1 V We claim it is 0. Indeed, if n 
comes from TV' -1 , then a preimage in M l of «/ , ~ 1 (re) may be obtained by taking a 
preimage of n in M 4_1 and mapping it to M*; but since the central column is a 
complex, this preimage maps to 0 in M i+1 . It follows that <5(n) = 0 in this case: 


0 

0 

0 

0 


v 

\ 


1 



"“I 

J 

t 



i 

b. > 

i 

i 


0 

0 

0 

0 


Therefore, 6 factors through irn t/ 1 . Since ker v % j im:/ 1 is nothing but the coho- 
mology we have obtained a morphism 

<5* : H l (N*) - ff i+1 (L«). 


On the other hand, each H‘ is a functor (Leunua 3.4); thus there are natural 
morphisms 

H\L*) ¥ H'iM*) + H\N*) 

for each i. Note that completing this diagram with 0 on the left and on the right 
does not give in general a short exact sequence in A (Exercise 3.8). In other words, 
the t-th oohomology is not an exact functor even if, as we wall see, the cohomology 
sequence is exact at Understanding the exactness and failure of exactness 

of the functors H' is, in a sense, what this discussion is all about. 

Together with the connecting morphism, we get a ‘long sequence’ of objects 
and morphisms in A, 

¥ iT'-^JV*) ^ #*(£*) — ¥ — ¥ H'W) fP+ l (L*) —¥ ■, 
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generalizing the sequence for H° and H l obtained in the snake lemma. The atten- 
tive reader must have seen the next result coming all along: 

Theorem 3.5 (Long exact cohomology sequence). The sequence determined, as 
above by a short exact sequence of complexes is an exact sequence. 

The snake lemma is recovered as a particular case of this statement (or con- 
versely, depending on your taste; cf. Exercise 3.10). 

Proof. The proof is a diagram chase, which everyone should perform once by 
oneself in his or her lifetime. So it is mostly left as an exercise for the reader 
(Exercise 3.9). But we will stress the extent to which H' is exact ‘on the nose’: 
part of the claim in this theorem is that if 

0 * L* M* N* > 0 

is exact, then the sequence induced by functoriality of H‘ 

0 > H‘(L*) — H‘(N*) > 0 

is exact at This sequence is not exact in general at H'{L*) and 

the moral of the long exact cohomology sequence is that the failure of exactness 
is precisely measured by other cohomology objects, by means of the connecting 
morphisms. 

Let’s check exactness at H'(M). The sequence is a complex, simply because 
H‘ is a functor; thus we only have to verify that ker/J C ima. If m is a class in 
H'(M*) such that 0(fn) = 0 in H'(N), m is represented by an element m € kerp' 
such that 0'(m) € imi/ 1-1 : 0'{m) = i/ -1 (n) for some n 6 N'~ l . There is an 
ml € M' -1 such that 0 l ~ x (m') = n, and m — p ,-1 (m') represents the same class 
m in H’(M) as m. By the commutativity of the diagram, 

0'(m - p‘ -1 (m')) = 0 l (m) — i/ ,_1 (n) = 0. 

Therefore there exists l € V such that a'(C) = m - p ,-1 (m'); then l represents a 
class ( € H l {L*) such that a(?) = m, concluding the verification. 

Verifying the exactness at the other places is similarly uninspiring and is left 
to the reader 19 . □ 

3.4. Triangles. There is an aesthetically pleasing way to state what we just proved, 
which turns out to provide a useful viewpoint. Take a short exact sequence of com- 
plexes, 

0 > L* » M* + N* > 0 ; 

the element of surprise in the long exact sequence obtained in §3.3 is the fact that we 
can connect N* to L* in an interesting way, after taking cohomology and allowing 
for a shift. Well, if we are willing to settle for something less interesting, we can 
connect N* to L* right away, by simply mapping N* to 0 within L*: 

N* — ^ L* ■ 


19 The reader is welcome to look for arrow-theoretic proofs. However* as discussed at length 
in §2, element-based proofs suffice to prove the statement in any (small) abelian category. 
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The exactness of the original short exact sequence is then equivalent to the exactness 
at the center of the three even shorter sequences: 

N* 0 > L* * M* , 

L* » M* 

M* 

A nice pictorial way to represent this situation is to fold the sequence into a trian- 
gle 20 , i.e.. 


L m 



N* < M‘ 


which we understand to be exact at its three vertices: this is an exact triangle 
in C{A). The arrow marked by +1 is in this case simply the zero-morphism; the +1 
records the fact that we are going to view this as a morphism from N* to the shift 
L[l]* of L* (after all, 0 is 0 in all degrees). Thus, the triangle is shorthand for the 
impressive commutative 3D-diagram 



or (understanding the morphisms in the complexes) the single long sequence 
* N‘~ l — ^ L* » M i * N' — >■ ■ • , 

whose exactness is equivalent to the exactness of the original short sequence of 
complexes. In §3.3 we have obtained from this another long exact sequence, 

> H i ~ 1 (N') — > JP(JV) H i+1 {L m ) — ► ■ ■ • , 

which we fold again into an exact triangle: 





H*{N*) < H*(M •) 


20 A common and easier to typeset notation is L # 
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This time the morphism marked +1 is the interesting connecting morphism (while 
the morphisms in the cohomology complexes are zero; see the end of §3.2). 

Thus, the ‘long exact cohomology sequence’ takes us from a certain exact trian- 
gle to another exact triangle. The first triangle arises from a short exact sequence 
of complexes (the ‘connecting morphism’ is 0), while the second does not (the con- 
necting morphism is in general nonzero). 

Clearly something interesting is going on. Triangles obtained from short exact 
sequences of complexes in C(A) appear to be ‘special’ and give rise to other exact 
triangles through cohomology. 

Actually, the situation may make the reader somewhat uncomfortable. While 
two sides of the cohomology triangle are obtained by simply applying the coho- 
mology functor to the corresponding sides of the original triangle, the third one is 
obtained by a different process, involving the connecting morphism. The reader 
may feel that there should be a mechanism to go from a triangle defined in terms 
of an exact sequence to its counterpart in cohomology by simply applying the co- 
homology functor. 

This is indeed so. There is an important notion of ‘distinguished’ triangle, in a 
category K(A) closely associated with C(A) and that we will encounter later 21 (§5). 
As with the ‘special’ triangles given above, distinguished triangles lead to long exact 
sequences in cohomology, and in a somewhat more direct fashion. Distinguished 
triangles satisfy a number of axioms, which define K(A) as a triangulated category. 
Exploring these concepts at any level of depth is well beyond the scope of this 
book, but towards the end of the chapter we will try to clarify these last cryptic 
remarks (§9.2). 


Exercises 


3.1. > Let (M*,d*) be a resolution of an object A of an abelian category. Verify 
that there are exact complexes 

—* M~ 2 M -1 tkerd 0 >A >0 . 

1 0 > A > coker d° * M 1 > M 2 ~~ — > • • ' 

and that replacing A by 0 in these complexes produces new resolutions of A. [§3.1, 

§ 6 . 1 ] 

3.2. -> Let I be the category whose objects are the integers and where Homi(m, n) 
consists of a singleton if m < n and is 0 otherwise (cf. Example 1.3.2) . Show that the 
category C(A) of cochain complexes is a full subcategory of the abelian category A 1 . 
[3.3] 

2I Beware that some authors call the distinguished triangles ‘exact’; there is some room for 
confusion, since the special exact triangles in the present discussion are not ‘exact’ from this 
viewpoint. 





Exercises 


603 


3.3. o Verify that the category C(A) of cochain complexes in an abelian category, 
defined in §3.2, is itself an abelian category. (You can do this either ‘by hand’ or 
by applying Exercises 3.2 and 2.14.) [§3.2] 

3.4. -■ Let A be an abelian category. Define a category Seq(A) whose objects are 
short exact sequences in A and where morphisms are commutative diagrams 

0 > A\ 4 Bi > Ci > 0 

I I 1 

0 ► A% > Bz > C 2 4 0 

Thus, Seq(A) may be viewed as a full subcategory of C(A). 

• Prove that Seq(A) is an additive category. 

• Prove that Seq(A) has kernels and cokernels. (Watch out: these are not the 
same as in C(A). Keep the snake lemma in mind.) 

• Show that, for every object X of A, 

0 >0 >1-^+1 >0 

■■ 1“' l 

0 >X — >0 >0 

is both a monomorphism and an epimorphism in Seq(A), although it is neither 
a monomorphism nor an epimorphism in C(A) if X ^ 0. 

• Prove that Seq(A) is not abelian if A has nonzero objects. 

[§1.3, 3.11] 

3.5. > Let (L*,dJ,.) be a cochain complex; define new differentials d?* L . by arbi- 

trarily changing the sign of d|,.: tf*,. = ±S L .. Show that (L*,df J,.) is a cochain 
complex isomorphic to [§3.2] 

3.6. Provide an arrow-theoretic proof of Lemma 3.4 (cf. Remark 3.1). 

3.7. Let A, B be abelian categories. An additive functor : A — * B is exact if it 

maps short exact sequences in A to short exact sequences in B. Prove that exact 
functors commute with cohomology: if & is exact and L* is a cochain complex 
in A, then =i where &(L*) denotes the cochain complex 

in B obtained by applying & to all objects and morphisms in L* . 

In particular, the image of an exact complex through an exact functor is still 
an exact complex (cf. Exercise VIII.1.23). 

3.8. > For any i, give an example of a short exact sequence of complexes 

0 > L m ► M* ► IV* > 0 

such that the sequence 


0 > tf*(L*) ► > H l (N*) 

is not exact at either H l (L*) or i?*(lV*). [§7.2] 


40 
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3.9. > Provide the gory details for the long exact cohomology sequence (Theo- 
rem 3.5). [§3.3,3.10] 

3.10. > Here is a way to reduce Exercise 3.9 to the snake lemma itself. 

Every complex (L*. A*) determines a complex ker*, with the object 22 ker A <+1 
in degree *, connected by zero-morphisms, and a complex coker*, with the object 
coker A' -1 in degree i, also connected by zero-morphisms. 

Prove that A* induces a morphism A : coker* — » ker* in the abelian category 
of complexes, such that ker A = H*(L*) and coker A = H(L*)[1\*. 

Now work out the whole long exact cohomology sequence as a consequence 
of the snake lemma. (You will use the version of the snake lemma given in Re- 
mark III. 7. 11.) f§3.3, 3.13J 

3.11. -> Prove that the long exact oohomology sequence is functorial, in the sense 
that it defines a covariant functor from the category Seq(C(A)) of short exact se- 
quences (cf. Exercise 3.4) of complexes to the category of complexes. [7.13] 

3.12. Redo Exercise III.7.17. 

3.13. The viewpoint in Exercise 3.10 admits the following straightforward gener- 
alization. 

Let A be an abelian category. Define a new category dA, whose objects are pairs 
(A, d), where A is an object of A and d : A — > A is a morphism (the ‘differential’) 
such that d 2 = 0. Morphisms (A,d A ) — * (B, d B ) in dA are morphisms : A — » B 
commuting with the differentials: d B <p = >fid A . 

• Prove that dA is an abelian category. 

• For any (A, d) in dA, prove that d induces a morphism d : coker d -* ker d. 

• Define H(A, d) to be ker d. Prove that coker d = H(A,d). 

• Show that H defines an additive functor dA — > A and that this functor is not 
exact in general. 

• However, prove that every short exact sequence 

0 > (A, d A ) > (B, d B ) — > (C, dc) 5- 0 

in dA induces an exact triangle (in the sense of §3.4) 

H(A, d A ) 



H(C,dc) 4- H{B,d e ) 

3.14. t> Prove that if one of the vertices of a ‘special’ triangle arising from a short 
exact sequence of complexes (as in §3.4) is exact, then the other two vertices have 
isomorphic cohomologies, possibly up to a shift. [§4.1j 


22 


'Of course ker and coker mean the target and source of the arrows ker and coker, respectively. 
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3.15. -> Define a ‘Grothendieck group’ and a ‘universal Euler characteristic’ \ for 
any abelian category A, in the style of the construction given in §VI.3.4. 

Extend \ to all bounded cochain complexes A/* in A, by setting \(A/*) := 
H,( - 1)'\(A/'). Prove that \(A/ # ) = For every short exact sequence 

of bounded complexes 0 — L* — ► A/* -* N* -* 0 , prove that \(A/*) = \{L') + 
\(A r *)- [4-3, 9.1] 


4. Cones and homotopies 

If n* : L * — * A/* is a morphism of cochaiu complexes, it is natural to try to describe 
the morphism induced in cohomology: 

H*(a*) : H'(L') - H'(M'). 

However, we have pointed out that H* is not an exact functor, and this seems to 
pose an obstacle to simple-minded strategies aimed at describing (for example) the 
kernel or cokernel of H*(n*) in terms of the kernel or cokernel of a*. The long 
exact cohomology sequence may be used to compensate for this lack of exactness: 
we will see how in §4.1. We will also begin exploring a very important condition 
(homotopy) guaranteeing that two given morphisms of complexes induce the same 
morphism in cohomology. Homotopies will be so important that they will lead us 
later on to change our notion of morphisms between complexes and construct a 
new ‘homotopic category’ of complexes (see §5). 

4.1. The mapping cone of a morphism. Let a* : L* — > M* be a mor- 
phism of cochain complexes. The mapping cone MC(a)’ of a* will allow us to 
recover H*(n‘) as the connecting morphism in a long exact cohomology sequence, 
as constructed in §3.3. In fact, the mapping cone will give us the third vertex of a 
‘special’ triangle (cf. the end of §3.4): 

(t) A/* 



L[l]* < A/C(a)* 

The degree-increasing morphism is the zero-morphism here (as in the ‘special’ tri- 
angles encountered in §3.4): but taking cohomology 

//•(A/*) 

y\ 

//*(L[1]*) <— //*(A/C(o)*) 

we will obtain morphisms 

H' + '(L*) = //'(/-[I]*) — H i+l (AI •) 

that, we claim, will be nothing but the morphisms induced in cohomology by the 
given morphism o*. Tims, the morphisms induced in cohomology cannot be ex- 
tender! to exact sequences (that, would clearly be asking too much), but they can 
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be extended to triangles arising by applying cohomology to an appropriate exact 
sequence of complexes. 

Incidentally, the triangle (f) displayed above (but with the degree-increasing 
morphism replaced by -a*) will be the prototype of the ‘distinguished triangles’ 
mentioned in. §3.4 (cf. §9.2). 

Construction of the mapping cone. The objects of MC(a) m are simply direct 
sums of objects of L* and M *: 

MC(a)‘ := L[l]‘ © M* = L i+l © M ( ; 

but the morphisms <P MC ( a y '■ MC(aY — » A/C(a)’ +1 are not the ‘obvious’ ones, 
but rather 

0 := (-4 + -V)V +1 (Q + 

The sign in the first component is inherited from the shifting of L*\ thus, the first 
component is simply the differential of Z»[l)*. Tire second component mixes a* 
(or rather its shift o[lJ*) and the differential of M*. The reader should verify that 
^MC(a)» ° <? MC(a )• = so MC(a)* is indeed a cochain complex (Exercise 4.1); 
this is where the sign comes in. The mapping oone is a special case of the ‘total 
complex’ determined by a double complex, to be explored later (§8.2). 

PictoriaUy, MC(a)* looks like this: 



There are evident morphisms of complexes M* — » MC(a)‘ and MC(a)* — * T[l)*, 
induced by the natural morphisms M l — * L’ +1 ©M‘ — > L i+1 . Since the sequences 

0 ► M l » i[l]* © M 1 ► i[l] i >■ 0 

are all exact, the sequence of complexes 

0 > M* > ItfC(a)* > £[!]* > 0 

is exact. 

Proposition 4-1. There is an exact triangle 





Jf(LUH -i )*) 

where the connecting morphism 5 is the morphism induced by at* in cohomology. 

Proof. The existence of the triangle is a direct consequence of Theorem 3.5; all we 
have to check is that the connecting morphism indeed agrees with the morphism 
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induced by a*. Chasing the diagram 


0 

0 


■* M‘ 



* L‘+ l ©M' 

I 


+ L' © A/* -1 

I 


t 

■>L i+ > >0 

I 

-+ u > 0 

I 


with a class in H l '(L[l)*) = H'(L *) represented by C € L' (such that d‘ L ,(() = 0), 
we find 

q'(£)i >(0,a'(f)) 

I 

(f,o)i >e 

which verifies the claim. □ 

Corollary 4.2. Let a* : L* — * M* be a morphism of cochain complexes. Then 
the induced morphism H*(L*) — * H*(M*) is an isomorphism if and only if the 
mapping cone MC(a)* is an exact complex. 

(Cf. Exercise 3.14.) 

We should mention that mapping cones arose in topology: the mapping cone 
of a continuous map f : X —* Y is obtained by considering X x [0, 1], identifying 
X x 0 to a point, and stitching X x 1 to Y by means of /. Analyzing chains of this 
space leads to the (homological version of the) ‘algebraic’ mapping cone considered 
above. 

4.2. Quasi-isomorphisms and derived categories. A component of our main 
strategy consists of understanding when cochain complexes ‘have a good reason’ 
to have the same cohomology. Corollary 4.2 provides us with such a reason. The 
morphisms singled out in that statement have a name: 

Definition 4.3. A morphism a* of cochain complexes is a quasi-isomorphism if it 
induces an isomorphism in cohomology. j 

By Corollary 4.2, this condition is equivalent to the exactness of the mapping 
cone of a*. 

Example 4.4. The datum of a resolution M* of an object A of an abelian cate- 
gory A, as in Definition 3.2 and with A/' = 0 for i > 0, is the same as the datum 
of a quasi-isomorphism 

q-iHo. . . . 

• > l(A) 
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where t places A in degree 0 and 0 in all other degrees. Here is a larger version of 
the same diagram: 

> M 2 — > A/ -1 * M° » 0 > • • • 

I I I I 

>0 >0 > A >0 > ••• 

where the only nontrivial vertical map is the cokernel of dj, 1 . . j 


Thus, quasi-isomorphisms may be viewed as generalizations of more simple- 
minded resolutions. Also note that the mapping cone of a resolution as in Exam- 
ple 4.4 is obtained (as the reader should check) by shifting the complex ‘one step 
to the left’ and completing it with A, obtaining the exact complex: 


-'.w. 


coker d ' ' 

-> M" ■ J -* A ■ 


-40 


t M~' 

This is nothing but our other point of view on resolutions (as in §VI.4.2). 


Remark 4.5. Quasi-isomorphisms with fixed target (resp., source) form a category 
in a natural way, which we will leave to the reader to make precise. In particular, 
resolutions in C-°(A) (resp., C-°(A)) of a given object of A, as in Example 4.4, 
form a category. j 


Wouldn't it be nice if quasi-isomorphisms were actual isomorphisms, that is, 
if they were invertible? In general, they are not. The following example will arise 
several times to ward off any such hopes. 

Example 4.6. In C(Ab), the morphism of complexes 

>0 >Z — — tz to — >■■■ 


to to 1 ^ to ► 

where it is the natural projection, is a quasi-isomorphism, but it does not have an 

•> 

inverse since there are no nontrivial homomorphisms Z/2Z — > Z. j 

In fact, this example shows that ‘quasi-isomorphism' is not even an equivalence 
relation (it is not symmetric). This is too bad — if it were, then the ‘equivalence 
class of a complex' would be the most natural candidate for the entity carrying the 
maximal amount of eohomological information of a complex. 

We will stop short of defining a ‘quasi-isomorphism relation': in this book, 
quasi-isomorphism is simply a particular quality of a morphism of cochain com- 
plexes, not a relation. In any case recall that, even in the context of sets, taking 
the quotient by an equivalence relation is not the ‘primary' object of interest: the 
quotient is just the solution to the natural universal problem of studying functions 
to other sets with identical behavior on ‘equivalent’ elements. This is the primary 
objective. 
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Similarly, the primary objective in the present context is not so much to identify 
together all complexes linked by a quasi-isomorphism, as it is to be able to shuttle 
information back and forth between such complexes. The natural way to accomplish 
this is to study environments in which quasi-isomorphisms do have inverses. More 
precisely, it consists of studying additive functors 

& : C(A) — > D 

such that &(p*) is an isomorphism for every quasi-isomorphism p m . (Cohomology 
is such a functor.) 

This sounds like yet another universal problem, and the natural line of attack 
would be to solve it as such, that is, to define (if possible) a category D(A) endowed 
with an additive functor C(A) — » D(A) such that quasi-isomorphisms in C(A) are 
mapped to isomorphisms in D(A) and through which every & as above must factor 
uniquely: 


C(A) D 



To be honest, we should take such a statement with a grain of salt: the natural 
environment in which we should pose this universal problem would be a ‘category 
of categories’, and we have not defined such a thing in this book. A category D(A) 
as above does exist, at least if a certain ‘localization’ process can be carried out 23 : 
it is the derived category of A. ‘Bounded’ versions D~(A), D + (A), and more, are 
solutions to the corresponding universal problems for appropriately bounded com- 
plexes. 

These derived categories answer the main question underlying our strategy, of 
understanding ‘what’ in a complex really determines its cohomology: the answer 
amounts to viewing the complex in the appropriate derived category. The con- 
struction of D(A) consists roughly of taking the same objects as C(A) and formally 
inverting the quasi-isomorphisms. The details of this construction are somewhat 
involved and are best left to more advanced texts. But we will be able to capture its 
essence in this book, at least in lucky cases (when the abelian category has ‘enough 
injectives’ or ‘enough projectives’; see Definition 5.7). 

The derived category has the unfortunate reputation of being an overly abstract 
notion, and there are fundamental questions about whether it really is the best 
approach to an abstract study of cohomology. This is partly because the derived 
category of an abelian category is not an abelian category and simple notions such as 
kernel, cokemel, exact sequences are not available in D(A). This adds a substantial 
layer of complication to the theory. 

On the other hand, going past these difficulties, one finds that enough structure 
remains to do much homological algebra: objects in D(A) have cohomology, and 
there are ‘distinguished triangles’ (cf. §3.4) abstracting exact sequences and giving 


^There may be set-theoretic issues with this process. However, these play no role in the 
cases we will consider, in which A has 'enough projectives’ or 'enough injectives’. 
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long exact cohomology sequences. The derived category is a triangulated category, 
like the more manageable homotopic category K(A) that we will soon define. Its 
uses go beyond the confines of algebra or even mathematics: an approach to the 
understanding of ‘D-branes’ in string theory is based on derived categories. 

To get a sense of how counterintuitive D(A) must be, note that the zero- 
morphism 0 : M* — * M* may well be a quasi-isomorphism: in fact, this is so 
precisely when M * is exact. Well, in any category D as above (and in particular 
in the derived category D(A)) this zero-morphism remains the zero-morphism, but 
it comes equipped with an inverse when it is a quasi-isomorphism. Thus, in the 
derived category the zero-morphism on a complex involving nonzero objects may 
well be invertible! This says that complexes that are nonzero in C(A) may become 
zero-objects in the derived category. 

Lemma 4.7. Let D be an additive category, and let & : C(A) — * D be an additive 
functor such that &(p*) is an isomorphism for every quasi-isomorphism p* . 

• Let M* be an exact complex in C(A). Then the complex obtained by 

applying & to the objects and morphisms of M* , is a zero-object in D. 

• Let a* : L* — * N* be a morphism in C(A) that factors through an exact 
complex, 

V — — ► N % 


M* 

with M* exact. Then &(a') is the zero-morphism. 

Proof. The second claim follows from the first, since by applying & to the given 
diagram we obtain 

■?(L*) y(Q,) > &(N*) 


&W) 

and we see that &{a') factors through a zero-object of D. 

To verify the first claim, note that since M* is exact, the zero-morphism: M* — » 
M* is a quasi-isomorphism: hence it is mapped to an invertible morphism by 

&{M') J?(A !•) 

>-(.«•> 

Since & is additive, J?(0) = 0. It follows that id,*^/.) = 0 and hence that &(M*) 
is a zero-object of D, by Exercise 1.6. □ 

In the next several sections we will chase the notion of derived category, aiming 
to understand it rather concretely in particularly favorable circumstances. We will 
take it for granted that these categories exist. A detailed definition of these objects, 
or a treatment of triangulated categories, is beyond the scope of this book. 
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4.3. Homotopy. The foregoing considerations put our strategy into focus: we are 
after constructions that determine cochain complexes ‘up to quasi-isomorphism’. 
However, quasi-isomorphisms appear hard to deal with directly. Thus, we look for 
more manageable notions that may work as an effective replacement. 

Like the mapping cone, this line of approach also owes its origins to topology: 
in topology, ‘homeomorphism’ is often too harsh a requirement, while ‘homotopy 
equivalence’ is a more malleable but still adequate notion. For example, homotopy 
equivalent topological spaces have the same homology. Distilling the algebra out of 
this notion leads to an analog at the level of complexes. 

To see how this is done, recall that if /, g are homotopic continuous functions 
between two topological spaces X and Y, then / and g induce the same map on the 
homology of the spaces. To remind yourself of how this works, look at the picture: 



This is supposed to represent the action of a homotopy between / and g on a chain a 
in X: h(a) is obtained by mapping a x [0, 1] to Y so as to get /(a) when restricting 
to a x {0} and to get g(a) when restricting to a x {1}. Note that h(a) is a chain 
of dimension 1 higher than the dimension of a, and that the boundary dh(a) of 
this chain consists of /(a), g(a) and of the restriction of h to the boundary da of a: 
taking the boundary ‘counterclockwise’, 

dh{a) = g(a) - 6+ - f(a) +6- = g(a) - /(a) - h(da) 

(or so it would seem from the picture! As presented here, this is of course at best a 
plausibility argument: it can be made rigorous, but that is someone else’s business). 
That is, 

g(a) - f{a) = dh(a) + h(da). 

Since boundaries vanish in homology, /(a) and g(a) will agree in homology. 

Here is the translation into algebra of this pleasant geometric situation: 


Definition 4.8. A homotopy h between two morphisms of cochain complexes 

a\0* : L* — * M* 


is a collection of morphisms 
such that Vi 


fc* : V 




0‘ - <*' = d'^jl oh‘ + h i+l o <f L , . 

We say that a* is homotopic to 0* and write o* ~ 0* if there is a homotopy between 
a* and 0*. j 
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We are dealing here with cochain complexes; this accounts for the fact that 
while in the topological situation (where we were interested in chains rather than 
cochains) the homotopy would shift dimensions up, in Definition 4.8 it shifts degrees 
down. 

Aside from its topological motivation, homotopy is not too easy to visualize. 
The following diagram is not assumed to be commutative: 



The morphisms h' do not usually define a morphism of complexes L' —* M[— 1]*: 
the lozenges in this diagram are not required to commute. 

Definition 4.9. A morphism a* : L* — * M* is a homotopy equivalence if there is 
a morphism P* : M* — * L* such that a m * p* ~ 1m* end (3* e a* ~ ].£,». The com- 
plexes L*, M* are said to be homotopy equivalent if there is a homotopy equivalence 
L* -* MV _i 

The reader should check that ~ is an equivalence relation and that ‘homotopy 
equivalence of complexes’ is also (Exercise 4.4). Further, these relations are clearly 
compatible with simple operations of morphisms (Exercises 4.5 and 4.6). 

Proposition 4.10. If a‘,P* : L* — * A/* are homotopic morphisms of complexes, 
then a*, p* induce the same morphisms on cohomology: H*{L') — > H*(M*). 

Proof. Let £ S H*(£*). Then £ is represented by an element £ 6 ker(<fy.), and its 
images in under the morphisms induced by a*, p* are represented by 

a f (£), /?’(£). 

Since a*, P* are homotopic, according to Definition 4.8 there are morphisms h l 
such that 

$'(£) - «*'(<) = 47i(ft*W) + A i+1 (4-(f))- 

Since £ 6 ker d l L . , the last term vanishes. This shows that 

F (l) - a\t) e im4., 

proving that p'{t) — a 4 (£) vanishes in as needed. □ 

For example, if a* is homotopic to the identity, then by Corollary 4.2 the cone 
of a* must be exact. It is a good exercise to verify this fact directly (Exercise 4.8). 

Corollary 4.11. Homotopy equivalent complexes have isomorphic cohomology. 
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Proof. Indeed, morphisms n* : L* — * A/*, 3* : M* — * L* such that d* » ex' and 
a* o 3 • are both homotopic to the identity induce inverse niorphisms in cohomology, 
by Proposition 4.10. □ 

Remark 4.12. In other words, homotopy equivalences of complexes are quasi- 
isomorphisms. This docs not yet. solve the ‘problem’ that quasi-isomorphisms are 
not invertible, since Example 4.6 shows that quasi-isomorphisms may well not be 
invertible even up to homotopv. In other words, ‘homotopy equivalence’ is a more 
restrictive notion than ‘quasi-isomorphism'. One reason to study complexes of in- 
jective/projcct ive modules, as we will do in §5.3. is precisely that, quasi-isomorphism 
and homotopy equivalence arc equivalent notions for (bounded) complexes of in- 
jective or projective modules. Since homotopy equivalence is a ready-made equiva- 
lence relation on complexes, this will bypass any technicality needed to make sense 
of ‘quasi-isomorphism’ as an equivalence relation. j 

Corollary 4.1 1 is the main technical reason that makes the strategy presented 
in §3.1 work, in a class of interesting examples. As we will see in tine time, in 
these examples the cohomology of the relevant cochain complexes will end up being 
independent, of the choices, because these choices will produce homotopic complexes. 

Another key observation will be that homotopy equivalence is preserved by any 
additive functor. Suppose A, B are abelian categories and 

.9 : A — • B 

is an additive functor (Definition 1.1). Then .9 induces an additive functor pre- 
serving the grading 

C (.9 ) : C(A) — * C(B). 

Lemma 4.13. With .9 as above, if a* ~ 3* in C(A), then C(.9)(a*) ~ C(.9)(3*) 
in C(B). and if L‘ and M* are homotopy equivalent complexes in C(A). then 
Q(.9)(L°) and C(.9)(M*) are homotopy equivalent complexes in C(B). 

Proof. The second assertion follows from the first. The first is an immediate 
consequence of the fact that .9 is additive. Indeed, if h is a homotopy between 
o\ 3* : L' — A/*, then 

3* —<*’ = <**/• °h‘ + /i i+l od‘ L .. 

Since .9 preserves the additive structure on Horn-sets, this implies 

■9(31 -9(a') = 9(d\~,l )o.9(h i )+ .9(h i+ 1 ) o f(dt L . ), 

showing that the collection of niorphisms ,9(h‘) gives a homotopy between .9(a*) 
and .9(3*). □ 

Note that the analogous statement does not hold for quasi-isomorphisms (Ex- 
ercise 4.15): an additive functor does not preserve quasi-isomorphisms (while an 
exact functor docs). 

Lemma 4.13 implies that, the conclusion of Corollary 4.11 holds true after ap- 
plying an additive functor: 
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Theorem 4.14. Let & : A — » B be an additive functor between two abelian cate- 
gories. If L* , M * are homotopy equivalent complexes in C(A), then the cohomology 
complexes 


are isomorphic. 


The proof of this statement is essentially immediate after all our preparatory 
work, so it is left to the reader (Exercise 4.16). 

We are gracing this statement with theorem status because it is at the root of 
the strategy we are pursuing. The content of Theorem 4.14 is that any mechanism 
associating to a mathematical object a cochain complex determined up to homotopy 
will give rise to a slew of interesting invariants: apply your favorite additive functor 
to any such complex, take cohomology, and Theorem 4.14 guarantees that the 
result will be independent of the specific chosen complex. This is (part of) the 
reason underlying the various claims of independence on choices made in §VIII.2.4 
and §VIII.6.4, concerning the definition of the Tor and Ext functors, as we will see 
later in the chapter (§7, especially Examples 7.6 and 7.7). 


Exercises 


4.1. > Verify that the cone of a morphism of complexes is a complex. [§4.1] 

4.2. Let L* = •■■ — *0— »I— >0 - * ••• and M* = -••— » 0 — * M — *()—»•-• be two 
complexes concentrated in degree 0. Giving a morphism a* : L* — + M* is then the 
same as giving a morphism a : L — » M. Describe the mapping cone in this case 
and its cohomology. 

4.3. Let a' : L' —* M* be a morphism of bounded complexes. Note that the map- 
ping cone MC(a) m of a is also bounded; thus, the three complexes L m , M*, MC(ct) m 
have a well-defined universal Euler characteristic (see Exercise 3.15). Prove that 
X (MC(a)*) = x(M«)-x(L’). 

4.4. 0 Prove that the homotopy relation between morphisms of complexes and 
the relation of homotopy equivalence between complexes are equivalence relations. 
[§4-3, §6) 

4.6. c> Let — ► MjJ, k = 0,1, be morphisms of oochain complexes. 

Assume that <*o ~ 0o, eti ~ j3\. Prove that «o ® <*i ~ © 0i as morphisms 

Lq © L* © M*. [§4.3] 

4.6. > Let «*,<*$, a* : L* — ► M* and 0* : M * — > N* be morphisms of 
cochain complexes. 

(i) Assume that a* ~ 0 or (3* ~ 0. Prove that (3 * « o* ~ 0. 

(ii) Assume that orj ~ a* and >35 ~ /??- Prove that « aj ~ ^ o a* . 

(Hint: Use (i), while thinking about ideals.) 

(§4-3, §5.2, §5.4, §5.5) 





Exercises 


615 


4.7. Prove that the equivalence class of morphisms of complexes L* — » M* homo- 
topic to a given morphism is parametrized by the set of collections of morphisms 
h' : V — > M* -1 , modulo the morphisms of complexes L(l|* — > MV 

4.8. t> Let a* be a morphism of complexes, homotopic to the identity. Prow 
directly (without appealing to Corollary 4.2) that the mapping cone of a* is exact. 
[§4.3] 

4.9. -■ Let a* : L* —* M* be a morphism of cochain complexes. The mapping 
cylinder MCyl(a)* is the cochain complex with objects IS $ L ,+1 © M i in degree i 
and differential d* MCyl ^, defined by 

= (4*rn + 1 , -4 + - 1 (<').« <+I (0 + **.(«»■ 

Verify that d]JcyJ(a)* °4fCyi(a)* = ^ and hence that MCyl(a)* is indeed a cochain 
complex. [4.10, 4.11, 4.12, 4.13] 

4.10. Topologically speaking, the mapping cylinder of a continuous map / : X — > Y 
is obtained by considering X X [0, 1] and gluing X x 1 to Y by means of /; studying 
(co)chains of this object leads to the algebraic mapping cylinder of Exercise 4.9. 

As we learn in topology, two continuous maps X — > Y are homotopic if they 
may be realized as the restrictions to X x 0, X x 1 of a continuous map from the 
cylinder X x [0, 1] to Y. Note that this cylinder is the mapping cylinder of id*. 

Prove that two cochain maps a*, ,3* : L* — > M m are homotopic in the sense 
of Definition 4.8 if and only if they extend to a cochain morphism : 

MCyl(id L .) - MV 

4.11. -> In topology, the mapping cone may be obtained by contracting X x 0 to a 
point in the mapping cylinder (cf. Exercise 4.9 and the description of the mapping 
cone given in §4.1). 

In the algebraic version, this contraction amounts to mod-ing out the first 
component of MCyl(a)*. For a cochain morphism a* : L m — ♦ M*, prove that there 
is an exact sequence of cochain complexes 

0 > L' * MCyl(oe)* > MC(a)* > 0 . 

[4.14] 

4.12. -i With notation as in Exercise 4.9 (in particular, for a cochain morphism 
a* : L* — * M*), prove that there is an exact sequence of cochain complexes 

0 M* > MCyl{a)' > MC(id/,«)* ► 0 • 

Deduce that there is a quasi- isomorphism M* — > MCyl(a)*. [4.13, 4.14] 

4.13. 1 Still with notation as in Exercise 4.9, define maps p* : M* — ► MCyl(a)* 
and a* : MCyl(a)* — * M* by 

p‘(m) = (0,0, m), 
o-’(£, m) = m — o*(f). 

Prove that p*, a * are both cochain morphisms. Note that a m p m = idjy., and prove 
that p*a* is homotopic to idwcyi(«)*- (Hint: i-> (0,f,0).) 
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Conclude that M* and MCyl(oi)* are homotopy equivalent. This strengthens 
the conclusion of Exercise 4.12. [4.14] 

4.14. Combine Exercises 4.11, 4.12, and 4.13 to conclude that, up to homotopy 

equivalence, a cochain morphism a* : L* —> M* may be replaced with a monomor- 
phism L* —> MCyl(a)* that induces the same morphism H*(a*) in cohomology 
(up to the identification = H'(MCyl(a)*)) and whose cokernel is the 

mapping cone MC(a)'. 

4.15. t> Prove that, in general, additive functors do not preserve quasi-isomor- 
phisms. (Hint: Example 4.6.) Prove that exact functors do. (Hint: Use the 
mapping cone.) [§4.3, §6] 

4.16. > Prove Theorem 4.14. [§4.3] 


5. The homotopic category. Complexes of 
project ives and injectives 

One message we have tried to convey in §4 is that while quasi-isomorphisms are 
(by definition!) the morphisnis of cochain complexes preserving cohomology and 
therefore may be deemed to ‘capture the cohomology of a complex’, the stronger 
notion of homotopy equivalence is in fact more natural from the point of view of 
applications. This is the moral of Theorem 4.14: homotopy equivalent complexes 
have the same cohomology for a much better reason than complexes linked by a 
quasi-isomorphism. If our general aim is to understand what it means to make 
all quasi-isomorphisms invertible (that is, understand the derived category D(A)), 
we may begin by making homotopy equivalences invertible. This produces a new 
category, the ‘homotopic category’ of complexes, that we approach in this section. 

We also examine the privileged position of bounded complexes of projective and 
injective objects regarding homotopy: for these complexes, quasi-isomorphisms are 
necessarily homotopy equivalences. Verifying this requires rather technical consid- 
erations, but it will be necessary in order to gain some understanding of the derived 
category, in §6. 

5.1. Homotopic maps are identified in the derived category, lb see that 
this ‘homotopic category’ is a necessary stop on the way from C(A) to D(A), go back 
to our higher-brow considerations at the end of §4.2. Again consider any additive 
functor 

9 : C(A) — » D 

transforming quasi-isomorphisms into isomorphisms. The simplest such functor is 
cohomology; the universal one is the functor from C(A) to the mystifying derived 
category D(A). We have verified in Proposition 4.10 that homotopic maps induce 
the same morphism in cohomology. Might this not be the case for any functor & 
as above? It is! 

Lemma 5.1. Let & : C(A) — > D be an additive functor such that & (p‘) is an 
isomorphism in D for all quasi-isomorphisms p* in C(A). 
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Leta*,p* : L* — ♦ M* be homotopic morphisms in C(A). Then JT (a*) = & (0*) 
in D. 

Proof. Equivalently (as & is additive), we verify that if o* ~ 0, then J*" (o’) = 0. 
By Lemma 4.7, it suffices to show that a* factors through an exact complex. 

This exact complex is the mapping cone A/C(id/,.)* of the identity id*,. : L* — * 
L*. Recall (§4.1) that 

MC(iA L .f = L ,+1 $ L\ 

with differential defined by 

<4c(id t .)( a - 6 ) = (-#.*(«).«+ 4,-m 

Since the identity is trivially a quasi-isomorphism, MC(id^«) is an exact complex 
(Corollary 4.2). The morphism 

1} - MC(id L .y, b t-+ (0, b) 

always commutes with differentials, so it defines a morphism of complexes L* — * 
MC(id£«)*. By contrast, a morphism 

MC(idi.)* -> M* 

is not always available. Under our hypothesis, however, there is a homotopy a* ~ 0, 
that is, morpliisms 

h* : U M*~ l 

such that a* = d£ji » h‘ + h‘ +1 o 4 » . Define morphisms 

7T l : MC(id L .y ->■ AT by (a,b)^ h i+1 (a) + a^b). 

It is clear that the composition 

V -» MC(id L .y -*• M i 

is a'. All we have to check now is that the collection n l defines a morphism of 
complexes: this will show that a* factors through the exact complex MC(idi«)*, 
hence that &(a*) = 0 (by Lemma 4.7), concluding the proof of this lemma. That 
is, we have to verify that 

(*) 4^ o7r ‘ 1 = rr' o 4 #C(ui l *)« 

for all i. The left-hand side of (*) acts as follows: 

(a, 6) i-» fc’(a) +• q i-1 (6) i-» d^i(h‘(a) + o’ -1 (6)). 

The right-hand side acts as 

(a, 6) ~ (— d£.(a),a+ <f£?(b)) ~ -h* 1 ^*)) + a’(a + ^(b)). 

Thus the needed equality is 

d&m*)) + d^V" 1 ®) = -^(4.(0)) + a’ (a) + ^(^(b)) 
or, equivalently, 

(dfcj oh i + h t+1 « 4. - a’)(a) = (a 1 . 4". 1 - d£J * o: i-I )(&) 

Va € L ,+1 , V6 6 V . But both sides are 0: the right-hand side because a? is a 
morphism of complexes and the left-hand side because h is a homotopy a* ~ 0. □ 
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Lemma 5.1 tells us that every functor transforming quasi-isomorphisms into 
isomorphisms must factor through the category obtained from C(A) by identifying 
together homotopic morphisms. It is time to define this category. 

5.2. Definition of the homotopic category of complexes. 

Definition 5.2. Let A be an abelian category. The homotopy category K(A) of 
cochain complexes in A is the category whose objects are the cochain complexes in 
A (that is, the same objects of C(A)) and whose morphisms are 

Hom K (A)(L*, AO := Hom C(A) (L*, M*)f ~ 

where ~ is the homotopy relation. j 

Yes, but is this a category? Recall that the homotopy relation ~ respects 
composition (Exercise 4.6); in other words, the operation of composition 

Hom C(A )(L*, AO x Hom C ( A )(AO N‘) — Hom C(A )(OiV*) 

does descend to an operation 

Hom K (A)(L*,AO x Hom K (A)(AO AO — Hom^O AO- 
Checking the category axioms is routine and is left to the patient reader. 

Bounded variations K~(A), K + (A), etc., are obtained likewise from C~(A), 
C + (A), etc. The considerations which follow will apply to all these versions. The 
further variations K~(P), resp., K + (l), in which the complexes will be required 
to consist of projective, resp., injective, objects from A (cf. §5.3) will also play a 
crucially important role. 

As to the general structure underlying the homotopic category, 

Lemma 5.3. Let A be an abelian category. Then the homotopic category K(A) of 
complexes is an additive category. 

Proof. Exercise 5.1. □ 

However, note that, in general, the homotopic category is not abelian. Indeed, 
homotopic maps do not have the same kernel or cokernel in general, so defining 
these notions becomes problematic. As we already mentioned in §3.4, K(A) is a 
triangulated category; the ‘distinguished triangles’ are the triangles arising from 
the cones of morphisms, as in §4.1. (We will briefly describe the situation in §9.2.) 

The main motivation for the introduction of the homotopic category is that 
homotopy equivalences become isomorphisms in K(A). More precisely, by definition 
there is a functor 

C(A) — * K(A), 

mapping every object to ‘itself’ and every morphism to its homotopy class. Ho- 
motopy equivalences in C(A) become isomorphisms in K(A) because the relation 
a* o 0' ~ id in C(A) becomes a* o/3* = id in K(A). The homotopic category is 
obtained from C(A) by making all homotopy equivalences invertible on the nose. 
We can then reinterpret Lemma 5.1 as the following ‘factorization’ result: 
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Proposition 5.4. Let & : C(A) — » D be an additive functor such that &(p‘) 
is an isomorphism in D for all quasi-isomorphisms p * in C(A). Then & factors 
uniquely through K(A): 



K(A) 


Proof. Exercise 5.2. □ 

In particular, there must be a unique functor from K(A) to the subtler derived 
category D(A). The situation is of course analogous for all bounded versions of 
these objects: the natural functors from the appropriately bounded complexes to 
the corresponding derived categories factor uniquely through the corresponding 
homotopic categories: 


C-(A) 

1 

C+(A) 

1 

4 

K-(A) 

1 

4 * 

K+(A) 

1 

4 

D-(A) 

4 

D+(A) 


A particular case of Proposition 5.4 is the fact that the cohomology functor on C(A) 
(resp., C~(A), C + (A)) descends to a functor on K(A) (resp., K~(A), K + (A)). This 
fact also follows directly from Proposition 4.10: homotopic maps induce the same 
morphism in cohomolog}'. 

5.3. Complexes of projective and injective objects. We should now ask ‘how 
far’ K(A) may be from D(A). We have inverted all homotopy equivalences in K(A), 
but not all quasi-isomorphisms are homotopy equivalences: keep Example 4.6 in 
mind. If we want to think of K(A) as a first approximation to D(A), we should 
focus on complexes for which there is essentially no distinction between quasi- 
isomorphisms and homotopy equivalences. 

It turns out that there axe such complexes: those consisting of projective or 
injective objects. 

Recall (Definition VIII.6.1) that an R-module M is ‘projective’ if the functor 
Hom;?(M,_) is exact and it is ‘injective’ if Horn r(_,M) is exact. We can adopt 
these definitions in any abelian category: 

Definition 5.5. Let A be an abelian category. An object P of A is projective if the 
functor Hoiua(.P,_) is exact. An object Q is injective if the functor Homft(_, Q) is 
exact. j 

General remarks such as Lemma V 111.6.2 or the comments about splitting of 
sequences at the end of § VIII. 6.1 hold in any abelian category, and a good exercise 
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for the reader its to collect all such information and verify that it does hold in this 
new context. 

Also note that, at this level of generality, the parts of the theory dealing with 
injective objects are a faithful mirror of the parts dealing with projective objects. 
This is of course not a coincidence: the opposite of sin abelian category is again 
abelian (Exercise 1.10), and projectives in one become injectives in the other. Thus, 
it is really only necessary to prove statements for, say, projectives in arbitrary 
abelian categories: the corresponding statements for injectives will automatically 
follow. 

This dual state of affairs should not be taken too far: projectives and injectives 
in a fixed abelian category may display very different features. For example, the 
characterizations worked out in §VIII.6.2 and §VIII.6.3 use the specific properties of 
the categories H-Mod, so they should not be expected to have a simple counterpart 
in arbitrary abelian categories. 

We will denote by P, resp., I, the full subcategories of A determined by the 
projective, resp., injective, objects. Of course these are not abelian categories in 
any interesting case. Note that an abelian category may well have no nontrivial 
projective or injective objects. 

Example 5.6. The category of finite abelian groups is abelian (surprise, surprise), 
but contains no nontrivial projective or injective objects (Exercise 5.5). j 

Definition 5.7. An abelian category A has enough projectives if for every object A 

in A there exists a projective object P in A and an epimorphism P » A • The 

category has enough injectives if for every object A in A there is an injective object Q 
in A and a monomorphism A> — >Q. j 

These definitions are not new to the reader, since we ran across them in §VIII.6: 
in particular, we have already observed that, for every commutative ring /?, H-Mod 
has enough projectives (this is not challenging: free modules and their direct sum- 
mands are projective, Proposition VIII.6.4) and enough injectives (this is challeng- 
ing; we verified it in Corollary VIII.6.12). 

Example 5.6 shows that we should not expect an abelian category A to have 
either property. An important case in which one can show that there are enough 
injectives is the category of sheaves of abelian groups over a topological space; 
this is a key step in the definition of sheaf cohomology as a ‘derived functor’, hi 
general, categories of sheaves do not have enough projectives. On the other hand, 
the category of finitely generated abelian groups has enough projectives but not 
enough (indeed, no nontrivial) injectives. So it goes. 

5.4. Homotopy equivalences vs. quasi-isomorphisms in K(A). As we al- 
ready mentioned, we will be interested in certain subcategories of K(A) determined 
by complexes of projective or injective objects of A. 

Definition 5.8. We will denote by K~(P) the full subcategory of K(A) consisting of 
bounded-above complexes of projective objects of A, and we will denote by K + (!) 
the full subcategory of K(A) consisting of bounded-below complexes of injective 
objects of A. j 
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The main result of the rest of this section is the following: 

Theorem 5.9. Let A be an abelian category, and let a* : Pq — > P*, resp., a* : 
Qo —* Qi, be a quasi-isomorphism between bounded-above complexes of projectives, 
resp., bounded-below complexes of injectives, in A. Then a* is a homotopy equiva- 
lence. 

Corollary 5.10. In K - (P) and K’ I "(I), (homotopy classes of) quasir-isomorphisms 
are isomorphisms. 

That is, all quasi-isomorphisms between bounded complexes of projectives or 
injectives are ‘already’ inverted in K~(P), K + (l). Thus, these categories are very 
close to the corresponding derived categories. In fact, we will see that if A has 
enough projectives, then K~(P) ‘suffices’ to describe D~(A) (in a sense that will 
be made more precise); similarly, K + (l) acts as a concrete realization of D + (A) if 
A has enough injectives. 

The proof of Theorem 5.9 will require a few preliminaries, which further clarify 
the role played by complexes of projective and injective objects with regard to 
homotopies. 

A complicated way of saying that a complex N* in C(A) is exact is to assert 
that the identity map id#, and the trivial map 0 induce the same morphism in co- 
homology, as they would if they were homotopic to each other. It is however easy to 
construct examples of exact complexes for which the identity is not homotopic to 0. 
As the reader will verify (Exercise 5.11), this has to do with whether the complex 
splits or not; in general, a complex N* is said to be ‘split exact’ if id at* is homo- 
topic to 0. Keeping in mind that projective or injective objects ‘cause’ sequences 
to split, it may not be too surprising that for bounded exact complexes consisting 
of projective or injective objects the identity does turn out to be homotopic to 0. 

To verify this, we study special conditions that ensure that morphisms are 
homotopic to 0. 

Lemma 6.11. Let P* be a complex of projective objects of an abelian category A 
such that P* = 0 for i > 0, and let L* be a complex in C(A) suck that H' (L*) = 0 
for i < 0. 

Let a* : P m — > L m be a morphism inducing the zero-morphism in cohomology. 
Then a* is homotopic to 0. 

The reader will provide the ‘injective’ version of this statement, dealing with 
morphisms from a complex which is exact in positive degree to a complex of injec- 
tives Q* with Q’ = 0 for i < 0. 

Proof. We have to construct morphisms A’ : P' -> L‘ _1 : 
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such that 

(*) a‘ =d i L - l oh‘+h i+l od i P .. 

Of course h' = 0 necessarily for i > 0. For t = 0, use the fact that the morphism 
induced by a* on cohomology is 0; this says that a 0 factors through the image 
of d~ L h 

P° 

o° 

dj 1 . * , 

L~ l > im d L i > 0 

Since P° is projective, there exists a morphism h° : P° — * L~ x such that a 0 = 
djT. 1 o h°, as needed. 

To define h'~ l for i < 0, proceed inductively and assume that h' and h' +i 
satisfying (*) have already been constructed. Note that by (*) (and the fact that 
P* is a complex) 



L <_1 — — * L' 


Therefore 

dfc} o(a' _1 - ti odjT. 1 ) = rf'jf.W- 1 - a' <j d'f. 1 = 0, 
since a * is a morphism of cochain complexes. This tells us that a' -1 — h‘ » (Tp, 1 
factors through kerdj^ 1 . Since L‘ is exact at L 1-1 for i < 0, kerd 1 ^ 1 = imd 1 ^ 2 . 
Therefore, we again have a factorization 


>»-i 


L l 


i— 2 . 


djr . 1 

ri-2 


+ im<^. >0 


Now F <-1 is projective; therefore there exists ft* -1 : P* -1 — * L'~ 2 such that 
a” 1 - h'otfp. 1 = d*-. 2 = fc*" 1 , 
which is precisely what we need for the induction step. 


Corollary 5.12. Let P* be a bounded-above cochain complex of projectives of an 
abelian category A, and let L* be an exact complex in C(A). 

Then every morphism of complexes P* — > L* is homotopic to 0. 


This follows immediately from (a harmless shift of) Lemma 5.11, since every 
morphism to an exact complex has no choice but to induce the zero-morphism in 
cohomology. 
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The ‘injective’ version of Corollary 5.12 is that morphism a from an exact com- 
plex L* to a bounded - below complex Q m of injectives are necessarily homotopy 
equivalent to 0. 

Note that in all these considerations the complexes P*, Q* are required to 
live in the ‘bounded’ versions C _ (P), C + (l): all objects should vanish in degree 
sufficiently high or low. This furnishes the ‘start’ of the inductive construction of 
the homotopy in the proof of Lemma 5.11. It is an essentially inescapable feature 
of the theory. 

Corollary 5.13. Let P* (reap., Q') be a bounded-above exact complex of projec- 
tives (reap., a bounded-beloui exact complex of injectives). Then P* (reap., Q*) is 
homotopy equivalent to the zero-complex. 

Proof. Exercise 5.12. □ 


Corollary 5.13 is the first manifestation of the principle captured more fully 
by Theorem 5.9: we have just verified that, for suitably bounded complexes of 
projectives or injectives, ‘quasi-isomorphic to 0’ is the same as ‘homotopy equivalent 
to O’. 

Another consequence of Lemma 5.11 is the following remark, showing that 
quasi-isomorphisms are ‘non-zero-divisors’ up to homotopy, with respect to mor- 
phisms from complexes of projectives. This will be useful in the next section, for 
example to show that certain lifts are uniquely defined up to homotopy. 

Lemma 5.14. Let A be an abelian category, and let p* : L* — » M* be a quasi- 
isomorphism in C(A). Let P* be o bounded-above complex of projectives, and let 
a* : P* —* L* be a morphism of cochain complexes such that the composition 

is homotopic to the zero-morphism. Then a* ~ 0. 


Proof. Let h? : P* — > M’ -1 define a homotopy between p* .a* and 0, so that 
—p' o a 1 = o h) + ft’ +1 « dp.. Consider the mapping cone MC{p)* of p* (§4.1), 

and define morphisms 

0- = (o\ A‘) : P i — L* © M' -1 = MC(p) i ~' i : 


->p»- 


4V 


_>pi. 


-> pi+l 




0i-i 


-> L i_1 ©M’ -2 


‘“CM 


0' 

► 4* 

* L) © M i_1 


?*' I 


» L i+1 © M i ■ 


We claim that these give a morphism of cochain complexes P* 
Indeed 24 , 

F+'otTp. = (« 4+1 o4>.,ft <+1 .4.) and 
_ 4 jc (( i ) *0 = o ot' ,—p' o a 1 — <T M l o A'); 


MC(p)[-l]‘. 


24 


It is time to review how the differential of the mapping cone was defined in §4.1. The 
negative sign will now come in very handy. 
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these are equal, by definition of h‘ and since a* is a morphism of complexes. 

Now, p' is a quasi-isomorphism by hypothesis. Therefore MC{p)‘ is an exact 
complex (Corollary 4.2). Applying Corollary 5.12, it follows that 0‘ is homotopic 
to zero. But a* is the composition 

P» MC(p)[- 1]* = r ® M[-l]* > L* , 

so (Exercise 4.6, part (i)) the fact that 0 % ~ 0 implies that a* ~ 0, concluding the 
proof. □ 

Example 5.15. To see that a* may not be zero on the nose even if p* o a* = 0, 
look back again at Example 4.6: 


P* : > 0 » Z ■ » Z - — — » 0 ► ■ • • 



Here p* is a quasi-isomorphism, and p* o a* = 0. According to Lemma 5.14, the 
(nonzero) morphism a* is homotopic to 0. (Indeed, a homotopy is immediately 
visible. What is it?) j 

5.5. Proof of Theorem 5.9. Theorem 5.9 will follow from a more general obser- 
vation: a quasi-isomorphism to a complex of projectives, resp., from a complex of 
injectives 25 , has a right, resp., left, homotopy inverse. 

Proposition 5.16. Let A be an abelian category, and let L* be a complex in C(A). 
Let P* in C~(P) be a bounded-above complex of projectives, and let a* : L* — * P* 
be a quasi-isomorphism. Then there exists a morphism of complexes 0* : P* — * L* 
such that a* o 0* is homotopic to idp • . 

(For the injective version, if Q * is a bounded-below complex of injectives and 
a* : Q* — * L* is a quasi-isomorphism, then there exists a morphism 0* : L* — * Q* 
such that 0* .o' ~ id« 5 « .) 


Proof. Since a* : L* — * P* is a quasi-isomorphism, the mapping cone MC(a)' of 
a is an exact complex (Corollary 4.2). Let p m be the morphism of complexes 

p* = (0, idp.) : P* - L[l]* ® P* = MC(a)\ 


Since MC(a )* is exact, p* is homotopic to zero by Corollary 5.12. Therefore, there 
exist morphisms 

h 1 : P‘ - MC(a)‘~ l = U ® P'- 1 

such that 


(*) 


p‘ = d' 


it- i 

A/C(o)» 


,h i +h i+1 od' 


p. 


25 That is, going ‘the wrong way’: projectives like being the sources of morphisms, and 
injectives like to be targets. 
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Now we unravel what this says. Write h‘ out in components: 

h * = 0 0,0) 

with 0 :P i -* L‘ and 0 : P* — P- 1 . We claim that 

(i) the collection {0} is a morphism of cochain complexes: 0 +1 °0 P . = d' L . » 0 
for all i and 

(ii) the morphisms h‘ give a homotopy a* of. J* ~ idp*. 

Indeed, the left-hand side of (*) is 

p‘ = (0, idp,); 

the right-hand side is 

47c ( „)* •(■ 0.h i ) + (0 + ',h i+l )od i P . 

= (-4. 00,000 + d‘ p .' ok’) + (0 +i O0 p .,h i+l orf'p.)- 
It follows that (*) amounts to 

f 0 +l od‘ P . -0 L .o0 =0, 

| idp, — a* o 0 = 0 P o o h’ + * o dp« , 

that is, precisely (i) and (ii). 

The morphism of complexes 0 : P* — » L* is the needed homotopy right-inverse 
of Of*. □ 

Taking L * to be a suitably bounded complex of projectives or injectives finally 
establishes Theorem 5.9: 

Proof of Theorem 5.9. We will give the argument for projectives. 

By Proposition 5.16, since P* is in C~(P) and a* : P* — * P* is a quasi- 
isomorphism, there exists a morphism of complexes 0 : P* — * P* such that 

a* o 0 ~ id p* . 

This implies that H*(a*)oH *(0) = id (Proposition 4.10). Since H*(a*) is invert- 
ible, so is H*(0): therefore, 0 is a quasi-isomorphism. Now P* is also in C~(P); 
hence by Proposition 5.16 there exists a morphism of complexes a*' : P,* — * P* 
such that 0 oq*' ~ idp*. Since 

a*' ~ (o* o 0) o a*' = a* o(0 o a*') ~ a*, 

(Exercise 4.6), it follows that 0 »a* ~ idp*. Thus a* is a homotopy equivalence, 
as stated. □ 
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Exercises 


5.1. t> Prove that the homotopic categories K(A), K (A), K + (A) of an abelian 
category A are additive. [§5.2] 

5.2. > Prove Proposition 5.4. (§5.2] 

5.3. Provide a reasonable definition of ‘projective’ and ‘injective’ objects in any 
category, and prow that, according to your definition, every set is both projective 
and injective in Set. 

5.4. i> Upgrade Exercises VIII.6.4 and VIII.6.15 to objects of any abelian category. 
[5.9, 6.3, §7.4] 

5.5. Let F be a nontrivial finite abelian group. Prove that there are exact 
sequences of finite abelian groups 

0 — > A\ — * A 2 — * F — ► 0, 0 — * F — ► B\ — ► i?2 — * 6 

which do not split. 

Deduce that the category of finite abelian groups has no nontrivial projective 
or injective objects. [§5.3] 

5.6. > Let A, B be abelian categories, and let X : A — * B, $ : B — > A be additive 
functors. Assume that & is left-adjoint to and that <£ preserves epimorphisms. 
Prove that if P is a projective object in A, then X(P) is a projective object in B. 
Formulate an analogous result for injective objects. (5.8, §7.1] 

5.7. -> Let A be an abelian category, and let C be a small category. Assume that 
A contains all products indexed by any set 26 ; for example, 7?-Mod satisfies this 
condition for every ring R. For an object A of A and a set S, denote by A s the 
product of A indexed by S. Note that any set-map S — » T determines a morphism 
A T — ► A s in A. 

For any object X of C, we will define a functor X : A — * A c ; a much more 
simple-minded functor X : A c — > A was defined in Exercise 1.11. 

For every object A of A, X(A) must be an object of A c , that is, a functor 
C — * A. The value of this functor at the object Y of C is prescribed to be 

X(A)Y := _4 Ho «»c( y ,X). 

Every morphism Y — ► Z in C determines a function Homc(Z, X) — » Homc(F, X), 
and this defines a morphism X (A)Y — > X (A)Z. 

• Prove that this prescription defines X (A) as a functor C -* A. 

• Prove that X is a functor A — > A c , with the evident action on morphisms. 

• Prove that X is right-adjoint to X . 

28 Recalf that ‘infinite’ products may be defined as limits; see Example Vin.1.10. 
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(Hint: You have to identify Hom A c(^, S£(A)) with Hom A (^(X), A). Given a 
natural transformation a : & — * ,5T(.A), evaluate at X and extract the identity 
component to obtain a morphism &(X) — > A. Then show that this one piece of 
information determines the whole natural transformation.) (5.8, 5.9] 

5.8. -> As in Exercise 5.7, let C be a small category and let A be an abelian cate- 
gory containing all products indexed by sets. Prove that the functor SC preserves 
injectives for all objects X of C: if Q is an injective object of A, then 3C(Q) is 
injective in A c . (Hint: Exercise 5.6.) (5.9] 

5.9. -> Let C be a small category, and let A be an abelian category containing all 
products indexed by sets. Assume that A has enough injectives. Let & : C — * A be 
a functor. For every object X of C, let Qx be an injective object of A admitting a 
monomorphism .?(X )> — >Qx- 

• With notation as in Exercise 5.7, let 2 := rix€Ot>j(C) ^(Qx) (prove that this 
product exists in A c ). 

• Prove that J2 is injective in A c . (Exercises 5.4 and 5.8.) 

• Define a morphism — * i?, and prove it is a monomorphism. (Use adjunction.) 

Therefore, we can conclude that A c has enough injectives if A is an abelian category 
with enough injectives and A is closed with respect to products indexed by sets. 

Prove that if A is an abelian category with enough projectives and A is closed 
with respect to coproducts indexed by sets, then A c has enough projectives. (Hint: 
(A c ) op as (A°")< c '">.) (5.10] 

5.10. Prove that the category of presheaves of abelian groups on a topological 
space has enough injectives and enough projectives 27 . (Use Exercise 5.9.) 

5.11. d> Let N* be the complex 

> 0 * K > M > N ► 0 > 

Prove that N* is exact and splits (in a sense analogous to the one given for modules 
in §111.7.2) if and only if idjy. is homotopic to 0 (cf. Proposition III.7.5). Deduce 
that every additive functor sends split exact sequences to split exact sequences. 

More generally, let N* be any complex. Prove that idjy is homotopic to 0 
(that is, N * is ‘split exact’ as defined in §5.4) if and only if N* is isomorphic to a 
complex of the form 

■ ■ • M~ 2 0 M~ l M -1 © M° M° ® M 1 -£* M 1 © M 2 • • • 

where d' is 0 on the M‘ factor, and (id A/ .+i ,0) on the M' +1 factor. (§5.4, 6.3, §7.4] 

5.12. o Prove that every exact, bounded-above complex of projectives is homotopy 
equivalent to zero. Prove that every exact, bounded-below complex of injectives is 
homotopy equivalent to zero. [§5.4] 

27 The category of sheaves of abelian groups on a topological space also has enough injectives, 
but in general it does not have enough projectives. 
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5.13. Let A be an abelian category, and let a* : L* -+ M* be a quasi-isomorphism 
in C(A). Assume that /?* is a homotopy one-sided inverse of a*. Prove that ft* is 
also a quasi-isomorphism. 

5.14. -i Let Pq, PI be bounded-above complexes of projective objects in an abelian 
category A. (You may assume A has enough projectives, if that helps.) Let L* be 
a (not necessarily bounded) complex in C(A). 

• Assume that there are quasi-isomorphisms 


PH 


IS 

V \? 


p; 


Prove that Pq and P* are homotopy equivalent. 
Assume that there are qnasi-isomorphisros 

PS Pf 

q-iso^N/ l^q-iso. 


Prove that P$ and P* are homotopy equivalent. (This may turn out to be 
challenging. If so, wait until you have covered §6, in particular Theorem 6.6.) 

[6.5] 


6. Projective and injective resolutions and the 

derived category 

Theorem 5.9 is very good news. It tells us that if we are willing and able to change 
our viewpoint and adopt bounded complexes of projectives or injectives as our 
natural environment, then we will have an excellent handle on the subtle notion 
of quasi-isomorphism: because quasi-isomorphisms become homotopy equivalences 
in that environment and homotopy equivalences are better behaved than arbitrary 
quasi-isomorphisms. For example, homotopy equivalence of complexes is manifestly 
an equivalence relation (Exercise 4,4), while quasi-isomorphism is not (see the dis- 
cussion following Example 4.6). Further, homotopy equivalences are preserved by 
every additive functor (Theorem 4.14), while general quasi-isomorphisms are not 
(Exercise 4.15). 

The question is therefore how to switch to a homotopic environment and adopt 
complexes of projectives or injectives as our primary object of study, while keeping 
intact the essential information carried by the objects of A or even the complexes 
in C(A). 

This can be done if A has enough projectives or enough injectives; in this 
situation, we will be able to produce an adequate description of the derived category 
(§6.3). 
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6.1. Recovering A. Recall once more that a ‘copy’ of an abelian category A is 
available within C(A), by associating with every object A in A the complex i(A) 
having A in degree 0 and 0 elsewhere. Composing with the functor C(A) — » K(A) 
realizes A as a full subcategory of K(A) (Exercise 6.1). We will look for ‘more 
interesting’ copies of A in the homotopic category. 

Let us begin by recalling (and sharpening) our definition of projective/injective 
resolutions of an object A. 

Definition 6.1. Let A be an object of an abelian category A. 

A projective resolution of A is a quasi-isomorphism P* —* i(A), where P* is a 
complex in C-°(P). 

An injective resolution of A is a quasi-isomorphism t(A) — » Q*, where Q* is a 
complex in C-°(l). j 

By common abuse of language, we will often refer to P* or Q* as the resolution, 
leaving the quasi-isomorphism understood. 

Remark 6.2. The terminology is potentially confusing, since it hints that the 
resolutions themselves may be projective/injective as objects of the abelian cate- 
gory C(A), or of its bounded variations. This is not the case (Exercise 6.3). j 

Projective or injective resolutions need not exist: as we pointed out (Exam- 
ple 5.6), an abelian category may have no nontrivial projective or injective objects. 
It is however clear that such resolutions exist if the category has enough projec- 
tives/injectives: if A has enough projectives, then given any object A of A there 
is a projective Po with an epimorphism sr : Po — > A; and then a projective Pi 
with an epimorphism d~ l : P -1 — > ker?r; and then a projective P~ 2 with an 
epimorphism d ~ 2 : P -2 — * kerd -1 ; and so on. Letting Sp, be the composition 
P* — >kerd‘ +1 > — ►P i+1 gives 

• • • » p- 2 ■ » p~ l dp ‘ > p» > 0 > • - • , 

a projective resolution of A. Just as clearly, if A has enough iiyectives, then every 
object A has an iryective resolution. 

Now consider the category of homotopy classes of quasi-isomorphisms with 
target t(A) (cf. Remark 4.5): this is a ‘homotopic category of resolutions’ of A. 
We claim that projective resolutions, if they exist, are initial in this category. 
(Analogously, injective resolutions are final in the homotopic category of quasi- 
isomorphism with source <.(A).) 

As the reader knows, this means that projective resolutions are supposed to 
map uniquely (in K(A)) to every resolution of A. The following key lemma proves 
this and more: 

Lemma 6.3. Let A be an object of an abelian category A. Let M* be a resolution 
of A, and let P* be any complex in C-°(P). Let 

<p : H°(P*) -> H°(M *) = A 
be an arbitrary morphism. Then 
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• there exist morphisms of complexes a* : P* — » M* inducing tp at the level 
ofH «; 

• different morphisms satisfying the previous requirement are necessarily homo- 
topy equivalent. 

Of course an analogous statement holds for complexes Q* in C-°(l), giving 
morphisms of cochain complexes M* — » Q* inducing a given morphism in H°. 


Proof. We have to define M‘ for all i. Since a 1 = 0 necessarily for 

i > 0, we may as well replace M* with its truncated version (cf. Exercise 3.1) and 
then extend both P* and this complex as follows: 





-40 


■M ~ 2 


M -i ker d? M . — = A > 0 


Note that the bottom complex is then exact. The morphism <p is given to us, and 
the task is to define the dotted lifts a’, i < 0, so as to obtain a morphism of 
complexes. 

Now, P° is projective and maps to A by <poTT\ this guarantees the existence of 
a lifting a 0 of <p. (The map P® — > to the degree-0 term of the original complex 

is obtained by following with ker > — *M° .) Next, note that 

p<> a 0 o dpi = tp cite dpi = 0; 

therefore, a 0 o dpi factors through ker p. Since the bottom complex is exact, ker p — 
im djfi, . Therefore, we have the diagram 






M~ l > im dr). > 0 


and a lift a 1 then exists since P 1 is projective. 

The construction of the other aT‘, i > 1 , proceeds inductively in precisely the 
same way, using at each step the fact that the bottom complex is exact. 

This proves the existence of a* : P* — » M*. Its uniqueness up to homotopy 
follows immediately from Lemma 5.11. □ 


Summarizing, if A contains enough projectives, then every object A of A admits 
a projective resolution, and further this is initial among all resolutions with target 
i(A). If A has enough injectives, then every object A of A admits an injective 
resolution, and this is final among all resolutions with source i(A). 

Recall that initial, resp., final, objects of a category are always unique up to 
isomorphism (Proposition 1.5.4). Here, this says the following: 
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Proposition 6.4. Any two projective (resp., injective ) resolutions of an object A 
of an abelian category A are homotopy equivalent. 

(This is also a direct consequence of Lenmia 6.3.) 

The moral we extract from these considerations is that if an abelian category A 
has enough (say) projectives, then we can associate with each object A of A an 
object of K- 0 (P), determined up to homotopy. This can in fact be done functorially. 
in the sense that morpliisms in A can be lifted to morphisms of corresponding 
projective resolutions: 

Proposition 6.5. Let Ao. A\ be objects of an abelian category A. and let P‘ be 
a projective resolution of A,, i = 0, 1. Then every morphism ^ : A () — • Ai in A is 
induced by a morphism a* : PJ — * P* . uniquely determined up to homotopy. 

Of course an analogous statement holds for injective resolutions. 

Proof. By hypothesis, q> is a morphism H l> (P*) — * H U (P'). The complex P* 
consists of projectives, and P* is a resolution of A\; therefore a lift or* exists and 
is unique lip to homotopy, as an immediate consequence of Lemma 6.3. □ 

The assignment provided by Proposition 6.5 is clearly covariant. The bottom 
line is that the part of A consisting of objects with (say) projective resolutions 
is equivalent to a full subcategory of K~(P). Dear reader, please formalize this 
assertion, for it is a key point (Exercise 6.2). If A lias enough projectives, this gives 
the promised ‘copy’ of A within K~(P). 

Of course the situation is entirely analogous concerning the subcategory of A 
consisting of objects admitting an injective resolution and K + (l). 

6.2. FVom objects to complexes. Suppose that an abelian category A has 
enough projectives. At this point we hopefully agree (Exercise 6.2) that the ho- 
liio topic category K - (P) contains a full subcategory which is equivalent to A: the 
(homotopy) category of projective resolutions of objects of A. While a priori each 
object of A corresponds to many projective resolutions, these are all homotopy 
equivalent to one another (by Proposition 6.4), hence isomorphic in K~(P); mor- 
pliisms in A correspond precisely to homotopy classes of morphisms between pro- 
jective resolutions (by Proposition 6.5), hence to morphisms in K"(P). 

In practice, this says that we can use this subcategory of K - (P) as a replace- 
ment for the original abelian category A. We are very close to achieving our goal of 
identifying the ’essential nature’ of cohomology: since homotopy equivalent com- 
plexes have the same cohomology (by Theorem 4. 14) and additive functors preserve 
homotopy equivalence, objects of K~(P) are ideal carriers of cohomology invariants: 
while applying an additive functor to objects of A in general destroys cohomolog- 
ical information, applying ’the same functor’ to a corresponding object in K"(P) 
preserves that information. 

We will follow this lead in the next section; the functors Tor,- and Ext' encoun- 
tered in Chapter VIII will arise precisely in this fashion and will stand as our poster 
examples of derived functors. 
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Of course, exactly the same situation will occur if A has enough injectives, in 
which case the appropriate replacement for A is found in K + (l). We will say a word 
about the general situation when projectives or injectives are not available, in §9.1. 

The reader should now wonder whether the categories K~(P), K + (l) may in feet 
be used to extract cohomological information for any (bounded) complex, not just 
for resolutions of a given object of A. As observed in §4.2, resolutions (in C - (A) 
or C + (A)) are just special cases of quasi-isomorphisms; the natural question is 
whether every (suitably bounded) complex has a ‘quasi-isomorphic copy’ in K~(P) 
or K + (I), again uniquely defined up to isomorphism in the homotopy category (that 
is, up to homotopy equivalence in C(A)). 

This is indeed the case. 

Theorem 6.8. Assume the abelian category A has enough projectives, and let L* be 
a complex in C _ (A). Then there exists a bounded-above complex of projectives P m 
and a quasi-isomorphism P* —* L*, and P* is uniquely defined up to homotopy 
equivalence. Further, every morphism a* of complexes in C~(A) lifts to a morphism 
of the corresponding projective resolutions in K~(P), also uniquely determined (up 
to homotopy). 

The lift obtained in the last part of this statement exists in the homotopy 
category: any representative in C _ (P) will be a ‘homotopy lift’ of a*, in the sense 
that the relevant diagram will only be guaranteed to commute up to homotopy. 

We will call a complex P* as in the statement a ‘projective resolution of L*’; 
this is a slight abuse of language, since it forgets the specific quasi- isomorphism 
P* -» L*. 

As always, there is a mirror statement to Theorem 6.6 for injectives, which we 
will leave to the reader to formulate precisely. It is a particularly good exercise for 
the reader to construct a stand-alone proof for the injective case. 


Proof. The proof of this result is admittedly rather technical, as it involves many 
of the tools that we have developed. 

Construction of P*. We may assume that L* is in C-°(A), 
ifl djl 

► L~ 2 — > L~ l — — > L° >0 »••• . 

and we are seeking a complex P* in C-°(P) and a quasi-isomorphism A* : P* — » L*: 



The construction will give us a little more: we will obtain a projective P* and 
an eptmorphism A* : P* — ► L‘ and moreover such that each induced morphism 
A* : ker dip. — * ker d^. is also an epimorphism. 

For i > 0, necessarily A’ = 0. Arguing inductively, we may assume we have 
already constructed a suitable A <+1 and use it to construct A*. Here is the diagram 
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summarizing this inductive step: 


4 - 



Here, «$,. restricts the target of dj,.. The square is a pull-back (cf. Examples 1.11 
and 2.2). The morphism A‘ +1 on the right is an epimorphism by induction, and it 
follows that the morphism on the left is also an epimorphism, by Lemma 2.3. A 
projective object P l with an epimorphism to the fibered product exists because A 
has enough projectives; the composition A' : P' — ► L* is then also an epimorphism, 
as claimed. Since the square is a pull-back, the kernel of 7r maps isomorphically 
to the kernel of (Exercise 1.16); it follows easily that the new induced map 
A* : ker dp, — > kerd^, is an epimorphism, concluding the inductive step and the 
construction of the complex P* and of the morphism A*. 

To see that the morphism induced in cohomology by A* is an isomorphism, 
look again at the pull-back diagram and note that H i+1 (F"), reap., H i+l (L‘), may 
be realized as the cokernel of ir, resp., 4»- Since A i+1 is an epimorphism, so is the 
morphism 

(4* ,-A m ) : V® ker dg. 1 — ► ker 4+ 1 . 

As the diagram is a pull-back, this implies (as observed in Example 2.2) that it is 
a push-out as well, and (Exercise 1.16 again) it follows that the induoed morphism 
coker jt —* coker d^. is an isomorphism. As noted, this is nothing but 

iP +1 ( A*) : H i+1 (P*) —* 


so we are done. 

Uniqueness up to homotopy. It suffices to compare au arbitrary P in C-°(P) 
mapping to L * with the complex P* constructed in the first part of the proof. 
Working on the same diagram used above, 
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we claim that the morphism P — » L* lifts to a morphism P — * P*. Indeed, both 
complexes are 0 in degree » 0; thus, for » » 0, F* — > P“ is necessarily the zero- 
morphism. Arguing inductively once more, we assume the lift has been constructed 
in all degrees > i, and we construct it in degree i. 

The composition P* — * P >+1 — > P' +1 — > P' +2 agrees with the composition 
F* -♦ P ,+1 — » P I+2 — > P ,+2 , so it is the zero-morphisin. Therefore this composition 
factors through ker dp", 1 : this gives the dotted morphism in the diagram. By the 
universal property of fihered products, we obtain the dashed morphism. Since P* 
maps epimorphically to L' x ter d i+i ker dpi 1 , the needed lift A* : P — * P* exists as 

y is projective. The ‘outer’ diagram is commutative by construction; hence the 
collection A* gives a morphism of cochain complexes A*. Note that so far we have 
not nsed the hypothesis that A is a quasi-isomorphism. 

We now have the following commutative diagram in C~(A): 


P* 



Both A* and A are quasi-isomorphisms; therefore so is A*. Theorem 5.9 implies 
then that A* is a homotopy equivalence, as needed. 

Lifting morpkisms. Let a* : Z* — * L m be any morphism in C~(A). The argu- 
ment we have just given shows that if P is in C~ (P) and L* is in C - (A), then every 
morphism F* — » L* lifts to a morphism F* — * P*, where P* is the complex con- 
structed in the first part of the proof. In fact, we have now established that there 
exists a homotopy lift 28 to any complex P* mapping quasi-isomorphically to L*, 
since every such complex is homotopy equivalent to the one constructed earlier. 

Applying this observation to any complex F* mapping quasi-isomorphically 
to L lifts a * as needed. 

Finally, assume that both a* lift a* homotopically: 



This means that A* o(a* — aj) is homotopic to 0: 



It follows that a* — arjj ~ 0, by Lemma 5.14, and this concludes the proof. □ 

28 That is, the corresponding diagram will commute up to homotopy. In general, a lift 
in C~(P) may not exist; ci. Exercise 6.10. 
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6.3. Poor man’s derived category. We are now in a position to dose a rela- 
tively large circle of ideas. We daim that if the abelian category A has enough 
projectives, resp., injectives, then K~(P), resp., K + (l), is a solution ‘up to isomor- 
phisms’ of the universal problem presented in §4.2. The precise statement is the 
following. 

By Proposition 5,4, the functor C(A) — > D(A) factors through the homotopic 
category; this of course holds for the bounded versions of these categories as well. 
Thus, we have functors K~(A) —* D”(A) and K + (A) — > D + (A). By restriction, we 
obtain functors 

K - (P) -> D _ (A), K + (l) — D+(A). 

Theorem 6.7. Let A be an abelian category with enough projectives. Then the 
functor K~(P) — » D (A) is an equivalence of categories. 

If A has enough injectives, then the functor K'*'(I) — ► D + (A) is an equivalence 
of categories. 

Theorem 6.7 is proven in full detail in any more complete treatment of homo- 
logical algebra. Since we have not actually constructed D~(A), we cannot really 
prove this statement here; but we now know enough to appreciate why Theorem 6.7 
should be true, in the sense that K~(P), K + (i) satisfy the expected universal prop- 
erties. We will deal with the projective side of the story, since the injective side 
mirrors it faithfully. 

Assume that A has enough projectives. Define a functor & : C _ (A) — * 
K - (P) by assodating with every bounded-above complex L* any projective res- 
olution .^ Z ’(L*) of L* (which exists by Theorem 6.6) and with every morphism 
a* : L* —* M * in C~(A) the morphism in K~(P) lifting a*. 

The morphism .^(ar*) is the homotopy dass of a homotopy lift of a*; we have 
proved that any two such lifts are homotopy equivalent. It also follows that & is 
indeed a functor: for example, if a* : L* -* M* and ft* : M* — ► N* are morphisms 
in C" (A), then both ■ < P(l3 t ) « /P(oc *) and !P(p* o«*) are classes of homotopy lifts 
of /3* oa*; therefore .^(/l*) <, .^(a*) = .^(/J* o a*) by lmiqueness, as needed. 

Note that there is a very large element of arbitrariness in the choice of &: we 
are not prescribing any particular recipe for choosing a projective resolution rather 
than another. But all such resolutions are homotopicaily equivalent, as proven in 
Theorem 6.6; therefore different choices would only move the resolutions in their 
isomorphism dass in K~(P). 

Remark 6.8. There is an efficient way to clarify (and defuse) this arbitrariness. 
Suppose .$*, fP' are two choices for the resolution functor C - (A) — » K~(P). Tims, 
we have chosen two projective resolutions .^'(L*) for each object L* in 

C - (A). By Theorem 6.6, there is a homotopy equivalence between .^(L*) and 
that is, an isomorphism /?*,• : &(L*) — » &\L*) in K~(P); this isomor- 
phism is unique (since it lifts the identity on L* and lifts are unique up to ho- 
motopy). Further, for every morphism of cochain complexes a* : L% -* L\, the 
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diagram 


*(W *** > -mri 


Peg 


9' 


Pet 


XS) > &'{L\) 


commutes: this is again by the uniqueness of lifts up to homotopy, since pi • o 9(a m ) 
and 9' (a*) a pig both correspond to lifts of ft* to 9(Lq) — » 9' (L\). 

This says that there is a unique natural isomorphism 29 between any two choices 
9, 9' of functors of projective resolutions. In a categorical context, this is essen- 
tially as good as uniqueness. Thus the arbitrariness in the choice of 9 is less 
dramatic than it may seem at first. _i 


Choose any functor 9 as above. We claim that 9 : C - (A) — ► K“(P) is 
essentially a solution to the universal problem defining the derived category' D~(A). 
Here is a more precise statement: 

Theorem 6.9. With notation as above, we have the following. 

• Ifp m is a quasi-isomorphism in C (A), then 9(p m ) is an isomorphism in K (P) 

• Let 9 : C~(A) — * D be an additive functor such that 9(p‘) is an iso- 
morphism for every quasi-isomorphism p*. Then there exists a functor 9 : 
K~(P) — » D, such that the diagram 



K-(P) 


commutes up to natural isomorphism. (That is, there is a natural transfor- 
mation v : 9 o 9 9 suck that 

v L . :9 o 9(L*) 9(L*) 

is an isomorphism for every complex X* in C~(A ).) The functor 9 is unique 
up to natural isomorphism. 


The fact that the diagram does not commute on the nose should not be too 
disturbing. It corresponds to the fact (stated in Theorem 6.7) that there is ‘only’ 
an equivalence of categories between K _ (P) and D~(A), not a hard and fast ‘iso- 
morphism’. As we already pointed out in § VIII. 1.3, this is what should be expected 
in a categorical context. 


2 ®'Natural transformations’ were rapidly defined in §VIII.1.S, Definition VIII.1.15. They are 
the most sensible notion of a morphism between functors. A natural isomorphism is a natural 
transformation that is an isomorphism on each object. 
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Proof. For the first point, let p* : L‘ M * be any morphism in C (A). By 
Theorem 6.6, p m lifts to a morphism of resolutions: we have a diagram 


&(L') 

Y 

L* - 


Um* 

>M* 


which commutes up to homotopy; here are quasi-isomorphisms, and 

is the homotopy class of p'*. If p* is a quasi-isomorphism, then so is p** 
(since the diagram commutes up to homotopy; hence it commutes after taking co- 
homology); it follows that -^(p*) is an isomorphism in this case, by Corollary 5.10. 

lb prove the factorization property, define & : K~(P) — * D by setting 
?(P*) := <£■(/>•) 

for any bounded-above complex P* of projectives, and define & of a morphism in 
K~(P) to be for any representative a* of that morphism. 

The fact that the action of & on morphisms is well-defined is precisely the 
content of Lemma 5.1: homotopic morphisms have the same image in 0. It is 
immediate that SF is a functor, and we have to check the statement about commu- 
tativity up to isomorphism of the diagram and the uniqueness of & up to natural 
isomorphism. 

Then let L* be a complex in C~ (A). By construction there is a quasi-isomorphism 
HL' : — > L*. Since -9 maps quasi-isomorphisms to isomorphisms, the in- 

duced morphism 

v L . : 


is an isomorphism. We just have to verify that the isomorphisms vi* define a 
natural transformation: that is, that for all morphisms a* : L* — > M* in C~(A) the 
diagram 


W)) 






W)> 


VL* 

&(L') 




4 


commutes. This digram (in D) is obtained by applying & to the diagram (in 
C(A)) 

&(L m ) -2^4 .^(M*) 

'“I I'"’ 

L* 2 > M* 

where a'* is a homotopy lift of a*. This diagram commutes up to homotopy, so 
the previous one commutes on the nose, again by Lemma 5.1. 
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7b see that $• is unique up to isomorphism, let & : K (P) — ♦ D be another 
functor satisfying the same property. Thus there is a natural isomorphism V : 
Wo0> , For P* a complex in K (P), consider the composition pp*: 

W) yfai ~ r , > n-nn) — > *(***) = ^( p *) 

(keep in mind that since pp> : &(P*) — > P* is a quasi- isomorphism, l?(pp») is 
invertible in D). The reader will verify (Exercise 6.12) that this defines p : "W — » & 
as a natural isomorphism, concluding the proof. □ 

If A has enough injectives, we can define a functor 2 : C + (A) -* K + (l) by 
choosing an injective resolution for every bounded-below complex. The situation 
concerning this functor is precisely analogous to the situation just reviewed for the 
functor iP, modulo reversing arrows as appropriate. 

Summary. If an abelian category A has enough projectives (resp., enough in- 
jectives), then the homotopic category K~(P) (resp., K + (l)) works as a replacement 
of the derived category D~(A) (resp., D + (A)). As we have argued earlier, viewing 
a complex in these categories is the most natural way to extract the maximum 
amount of ‘cohomological’ information from the complex. The category A itself (or 
a category equivalent to A) sits as a full subcategory of these categories. 

To reiterate the key point, there is a clear advantage in stopping short of taking 
cohomology. Resolving a complex L * by a complex P* of (say) projectives, thanks 
to Theorem 6.6, gives us an entity which certainly captures the cohomological 
information of L* but that can still be manipulated with the entire range of tools 
available in dealing with complexes. For example, we can apply to P m any additive 
functor and still obtain a complex whose cohomology only depends on the original 
complex L* (by Theorem 4.14). The cohomology of L* is only one example of the 
cohomological invariants that may be extracted from L*; viewing L m in the derived 
category gives direct access to all these invariants. 

We will capitalize on these considerations in the next section. 


Exercises 


6.1. t> Prove that the full subcategory of K(A) consisting of complexes concentrated 
in degree 0 is a copy of the abelian category A. [§6.1] 

6.2. t> Assuming that A has enough projectives, prove that the full suhcategory 
A of K~(P) consisting of complexes with cohomology concentrated in degree 0 is 
equivalent to A. (Prove that H° is fully faithful on this subcategory.) f§6.1, §6.2, 
6.13, §7.1) 

6.3. t> Let A be an abelian category, and let P* be a complex that is projective as 
an object of C(A). 

• Prove that each P* is projective in A. 
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• Prove that there is a morphism of complexes P* — * MC(idp»)[— 1], such that 
for each i the corresponding morphism P-tP$ pi-l j s Q f tjj e f orn j (id t fa'). 

• Prove that the collection h‘ gives a homotopy between idp» and 0. 

• Conclude that P* is a split exact complex of projectives. 

_2 

(For example, the complex » 0 — ► Z — * Z — *0— is not projective in 
C(Ab).) 

• Conversely, prove that if P* is a split exact complex of projectives, then P* 
is projective in C(A). (Hint: Use Exercises 5.11 and 5.4 to reduce to the case 
► 0-*P^P-»0-»---.) 

• If P* is a bounded-above complex of projectives, prove that P* is projective as 
an object of C(A) if and only if it is exact. 

[§6-l] 

6 . 4 . t> Let A be an abelian category with enough projectives, and let fP : C~(A) — » 
K - (P) be a functor choosing a projective resolution for every bounded-above com- 
plex, as in §6.3. Prove that (P induces a functor K” (A) — > K” (P), which we will also 
denote by (P. (Use Lemma 5.14.) Next, let P be the embedding K“ (P) — » K~(A). 
Prove that -P o P is naturally isomorphic to the identity functor and that there is 
a natural transformation (not an isomorphism in general) P o !P id K -( A ). [§7.1, 
§7.2J 

6 . 5 . (Cf. Exercise 5.14.) Let A be an abelian category with enough projectives, 
and suppose there is a chain of quasi-isomorphisms 



p* £2 ••• £*-1 V 


in C (A), with P* and P in C (P). Prove that P* and P are homotopy equivst- 
lent. 

Formulate and prove a statement in which the morphisms point to (rather than 
from) L\, LI,..., L*. 

6.6. Let A be an abelian category, and assume for simplicity that A has enough 
projectives. Let p* : M* —* N m be a morphism in C _ (A). Prove that p* induces 
the zero-morphism in D“(A) if and only if there exists a quasi-isomorphism A* : 
L* M* in C“(A) such that /i* o A* is homotopic to zero. 

6.7. For any integer m > 1, view the short exact sequence 

0 >Z m > Z ► Z/mZ >0 

as a complex in C(Ab). Prove that the identity morphism on this complex is not ho- 
motopic to zero, but it induces the zero-morphism in the derived category D~(Ab). 

6.8. For any integer m > 1, view the short exact sequence 

0 Z tZ/mZ >0 
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as a complex in C(Ab). Find explicitly a projective resolution of this complex 
produced by the argument at the beginning of the proof of Theorem 6.6. 

6.9. -i Show that the morphism induced in D~ (Ab) by the cochain map 

0 tZ — - — tZ to 

i I 

0 t Z t Z/3Z t o 

is the zero-morphism, while the morphism induced by 
0 tZ — - — tZ >0 

id 2 

0 1 z 1 Z/3Z 1 o 

is not the zero-morphism. Note that both morphisms induce 0 in cohomology. 
Thus, nonzero morphisms in the derived category may induce the zero-morphism 
in cohomology. (This is another indication that the derived category carries more 
information than ‘just’ cohomology.) (6.10, 6.11] 

6.10. > Consider the second morphism given in Exercise 6.9, together with a pro- 
jective resolution of the bottom complex: 



Prove that a ‘true’ lift as indicated by the dashed arrows does not exist but that 
one exists up to homotopy (as prescribed by Theorem 6.6). [§6.2] 

6.11. Let R be an integral domain, and let r be a nonzero, nonunit element of R. 
Construct a nonzero morphism in D _ (f?-Mod) between the complexes 


-to — 

^ 0 

— ► R/(r) ~ 

— to 

+ 0 — 

— >R — 

>0 — 

—to 


matching degrees as indicated. (This will be particularly easy if you have worked 
out Exercise 6.9). 

6.12. > Complete the proof that the functor & in Theorem 6.9 is unique up to 
natural isomorphism. [§6.3] 
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6.13. t> (Cf. Exercise 6.2.) Let A be an abelian category with enough projectives, 
and let A be the subcategory of K“(P) whose objects have cohomology concentrated 
in degree 0. Choosing (arbitrarily) a projective resolution for every object A of A 
and lifting morphisms as in Proposition 6.5, we obtain a functor A — » A; with 
notation as in §6.3, this is & oi. It is dear that H° o 9 o i is the identity functor 
on A. Prove that there is a natural transformation from the identity functor on A 
to 3»oioH°. [§7.1] 


7. Derived functors 

We will now adopt the blanket assumption that A is an abelian category with 
enough projectives, so that — as explained in §6.3 — the homotopic category K~(P) 
is a concrete realization (up to equivalence of categories) of the derived category 
of the bounded-above complexes in A. Of course everything would go through, 
with appropriate changes, for hounded-below complexes in the presence of enough 
injectives. These assumptions are not necessary in order to develop this material, 
but they simplify it substantially. 


7.1. Viewpoint shift. After deriving categories, it should not seem too far-fetched 
to derive functors. 

We have seen (Exercises 6.2 and 6.13) that, if A has enough projectives, then 
the full subcategory A of K - (P) whose objects are complexes with cohomology 
concentrated in degree 0 is equivalent to A: H° is a fully faithful, surjective functor 
A — ► A. With notation as in §6.3, the functor associating with every object 
of A a projective resolution gives a ‘weak’ inverse to H°. As pointed out in §6.3, 
there are many different possible each of them will determine a copy & o t( A) 
of A in K~(P). 

It may be argued that any such realization of (a category equivalent to) A is 
‘better’ than A itself. Indeed, we have repeatedly stressed that all the cohomological 
information carried by a complex is ideally captured by viewing that complex in 
the derived category, and for the complex i(M) determined in the simplest way by 
an object M of A, this simply means replacing i(M) with any projective resolution 
of M, viewed as an object of K~(P). 

This, however, raises a question: if ^ : A — ► B is a functor and we are serious 
about replacing A with its counterpart^) in D~(A), then there should he a way to 
‘reinterpret’ & in this new context: induce in some natural way a ‘derived functor’ 
D“(A) — * D~(B). One would hope that this functor should carry at least as much 
information as & and satisfy better properties. 

Now we claim that there indeed is an evident way to induce a functor between 
the realizations of the derived categories in terms of K~(P). It is a little less clear 
what question this functor answers — that is, what kind of universal property it 
satisfies. The reader is invited to sort all of this out before we do it. 
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Say that & : A — * B is an additive functor between two abelian categories. It 
is dear that 9 extends to a functor 


W : C(A) — > C(B) 

(we briefly encountered this functor in §4): the complex L* is sent to the complex 
C(&)(L*) whose objects are &(L') and whose differentials are &(<?£,). Note that 
the functoriality and additivity of & imply that 

o ^(4.) = J^. 1 o 4.) = *(0) = 0, 

so that C(#)(L*) is indeed a complex. It is equally clear that if a* is a morphism of 
cochain complexes in C(A), then the collection •^'(a 1 ) defines a morphism in C(B), 
and that this assignment is functorial. Now, we already checked (Lemma 4.13) that 
homotopic morphisms in C(A) are sent to homotopic morphisms in C(B); it follows 
that & induces a functor K(^) : K(A) — > K(B). The diagram 


A 


9 


K(A) 


KW 


1 - 


K(B) 


commutes. Also, this operation clearly preserves boundedness conditions. 

If now it were the case that additive functors send projective objects to projec- 
tive objects, then we could restrict K(^) to K _ (P(A)) and land in K~(P(B)) (where 
P(A) and P(B) denote the classes of projective objects in A, resp., B); given our 
discussion in §6.3, this would give us a natural candidate for the ‘derived functor’ 
oi&. 

However, additive functors do not preserve projectivity or injectivity in gen- 
eral. Why should they? For example, ‘tensor’ does not preserve projectives 30 (Ex- 
ercise 7.1). Assuming that B has enough projectives, the natural way to ‘fix’ this 
problem is to apply a corresponding functor • < 3 s b constructed as in §6.3, associating 
with each complex a projective resolution. By Lemma 5.14, sends homotopic 
morphisms to homotopic morphisms; hence ,^b also descends to a functor defined 
on the homotopic category (cf. Exercise 6.4). 

Definition 7.1. The left-derived functor of & is the composition 


K~(P(A)) — ► K~(A) - Kfjy) ■ > K“(B) K~(P(B)) 


19 


where is the ‘inclusion’ of K (P(A)) as a full subcategory of K (A). _i 


It would seem sensible to denote this functor as D {&)\ but the notation ‘L^’ 
is well established, so we will adopt it. 

Of course a parallel discussion can be carried out if A, B have enough injectives, 
and it leads to a ‘right-derived functor’ D + (^), pardon me, RjP. 

3 ®Note that tensor is right-exact, and it is so because it is left-adjoint to another functor; see 
Corollary V1H.2.5. Functors that are left-adjoints to right-exact functors do preserve projectives: 
Exercise 5.6. 
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7.2. Universal property of the derived functor. The definition we have given 
for the derived functor depends on the chosen resolution functor &bi so that Li? : 
K~(P(A)) — * K~(P(B)) is really only defined up to a natural isomorphism. In any 
case, the derived functor should be thought of as the natural counterpart of & at 
the level of the derived categories of A and B: up to equivalences of categories, L<^ 
acts as D~(A) — * D~(B). 

Still, it is not too clear exactly what question this definition answers. For 
example, the derived functor does not fit into a commutative diagram analogous to 
the one displayed above: 


k ?? - 

K~(P(A)) L * > K~(P(B)) 

Here A and B are the replacements for A, B in the corresponding derived categories, 
discussed at the beginning of §7.1, that is, the subcategories whose objects have 
cohomology concentrated in degree 0. The point is that there is no reason why 
applying L& to a complex whose cohomology is concentrated in degree 0 should 
yield a complex with the same property, unless & is very special to begin with. 

This may be viewed as a nuisance. On the contrary, it is one of the main values 
of deriving categories and functors. Recasting an additive functors & : A — ► B at 
the level of derived categories, one gets access to interesting new invariants even 
when starting from (the equivalent copy in the derived category of) A itself. 

Exactness plays an important role hi these considerations: 

Example 7.2. Suppose that & is (additive and) exact. Then we claim that A is 
sent to B by L#. 

Indeed, let P* be a complex in A: H t (P*) = 0 for i ^ 0. The image L^(P*) 
is obtained by choosing a projective resolution of & (P*). Since quasi- 

isomorphisms preserve cohomology and exact functors commute with cohomology 
(Exercise 3.7), 

H^L^P*)) = lP(P>(p.)) = H‘(^(P*)) SS i?(j P(P*)) = 0 

for i ■=£■ 0. Therefore L$P(P*) is an object of B. 

We also see that H°(L^(P*)) = &(H°(P*)) in this case: this says that the 
restriction of Li? to A — > B agrees with & if & is exact, modulo the equivalences 
of categories between A, resp., B, and A, reap., B. _i 

The moral of Example 7.2 is that deriving exact functors is relatively straight- 
forward, and we cannot expect to learn anything new about an exact functor by 
deriving it. We will mostly be interested in deriving functors that are not exact on 
the nose but preserve a certain amount of exactness. 
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Another diagram that may look promising is 
K- 
■<* k 

k "(P(A))- I5 -*K-(P(B)) 

This also should not be expected to commute. Note that since most of the functors 
appearing here are only defined up to natural isomorphism, the best we could hope 
for is that this diagram commutes up to natural isomorphism: that is, conceivably 
there could be a natural isomorphism 

L& o -^b o K(^). 

Well, in general there isn’t (Exercise 7.3). What is UP good for, then? 

Proposition 7.3. The left-derived functor l& satisfies the following universal 
property: 

• There is a natural transformation 

L? o ~ ° K(JT); 

a for every functor & : K~(P(A)) — ♦ K~(P(B)) and every natural transformation 
7 : &qqK(&), there is a unique (up to natural isomorphism) natural 

transformation ’ff -v* L& inducing a factorization of 7 .' iff 0 o &a 

<? e oK(&). 

Thus, excuse our poor diagram, for it is doing its best to commute, and any 
choice of a bottom side other than L& would make it commute even less. 

Proof. If we have done our homework (and in particular Exercise 6.4), then we 
know that there is a natural transformation 

Ja o id K -(A)i 

composing on the left by ,‘Pg 0 K(^) gives the first point, since LJ? = /Pg 0 K(JP) o 
J'a, by definition. 

To verify the second point, note that every natural transformation *ff LJF 
will induce a natural transformation 7 as in the statement, by the first point; on 
the other hand, every factorization of 7 , 

iff o ,^A o &a .^B 0 K(^), 

comes from one and only one (up to natural isomorphism) natural transformation 
ff L^, because this can be recovered by composing on the right by idjr A (and 
again using Exercise 6.4): 

iff -Ci iff o &a o A -^B o K(^) o/ A = L^-. 

This concludes the proof. □ 

The reader should formulate the universal property satisfied by the right- 
derived functor. In this case, not surprisingly, all natural transformations go ‘back- 
wards’. 
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Remark 7.4. The fact that there is a natural transformation J* o id (Exer- 
cise 6.4), used in the proof of Proposition 7.3, has another interesting consequence. 
Let A, B, C be abelian categories, and let & : A — » B, 5? : B — » C be additive func- 
tors. Assume that A and B haw enough projectives, so that the derived functors 
L& and are defined, as well as the derived functor L(# o .?). Then there is a 
natural transformation 

\JS o L($? o &). 

Indeed, in the same notations! style as above, \JS o L& expands to 
&> c o K(») o/ B o &> B o K(^) o J^a, 
and contracting o ,^ B to id K - takes this to 

0>c o K(Sf) o K(^) o/ A = L($f o ^). 

If additional conditions are satisfied (for example, if & preserves projectives), then 
this natural transformation is a natural isomorphism, so L ‘distributes’ through 
compositions in this case (Exercise 7.4). 

Towards the end of the chapter the reader will look again at the relation between 
\SS o L& and L(S? o &) from a different point of view, using spectral sequences 
(Exercises 8.8 and 9.10). j 

7 . 3 . Taking cohomology. The content of Proposition 7.3 is again in line with our 
main strategy: in moving from A to B, the left derived functor & is the functor that 
preserves ‘as much cohomological information as possible’ regarding bounded-above 
complexes, in the sense that it is the closest one can get to extending & : A — > B 
to a functor D _ (A) — ► D~(B). 

Applying cohomology extracts this information; actually, when deriving on the 
left, the indexing works better if we take homology rather than cohomology. The 
i-th left-derived functor of & is the functor L,^ := if - * o L.#\ Note that 

U#{P*) = H- i (.^ B (C(^)(P*))) « /r i (C(^)(P*)), 

since ,^b(C(^)(P*)) is quasi-isomorphic to C(&)(P*). Thus, if cohomology is all 
we are interested in, it is not necessary to find a projective resolution of C(^)(P*). 
(hi particular, we can define Li& even if B does not have enough projectives.) 

At this point we can come down to earth and define each as a functor A — » 
B: we know that this is a somewhat limited scope (because L can properly be 
defined on the much subtler D~(A)), but it will be the source of the only applications 
we will really consider at any level of depth, and historically this is how derived 
functors arose originally. 

Definition 7.5. Let A, B be abelian categories, and assume that A has enough 
projectives. Let & : A — ► B be an additive functor. The i-th left-derived functor 
Li& of & is the functor A— * B given by 

= H~* o L& o o t A . 

For an object M in A, the complex in C(B) with Lj^(M) in degree —i and with 
vanishing differentials is denoted by L»^(M). j 
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Let’s spell this out. Given an object M of A, L ,&(M) is obtained by finding 
any projective resolution of M, applying the functor C(&) to PI, to obtain 
a complex in C(B), and taking the (-i)-th cohomology of this complex 31 . Up to 
isomorphism, the result does not depend on the choice of the projective resolution: 
this is clear from the path of concepts that led us here, and it is directly implied by 

• Proposition 6.4, showing that any two projective resolutions are homotopy 
equivalent, in conjunction with 

• Theorem 4.14, showing that the images of homotopy equivalent complexes via 
an additive functor have isomorphic cohomology. 

Of course we can similarly define the i-th right-derived functor R'& of an 
additive functor & : A -> B and complexes R provided that A has enough 
injectives: concretely, R'^(M) is the cohomology of the image of an injective 
resolution of M, i.e., 

R'& = H i oR^o^ a o t A . 

So far, we have always implicitly assumed that & was a c ovariant functor. 
Contravariant functors A — * B should be viewed as covariant functors A op — * B 
(Definition VIII. 1.1); the roles of injectives and projectives should therefore be 
swapped. Thus, the right-derived functors of an additive contravariant functor 
A — ► B will be defined if A has enough projectives: A op will then have enough 
injectives, as needed. 

The attentive reader already knows two families of examples of derived func- 
tors, both defined from the category fi-Mod of modules over a (commutative) ring 
to itself. Recall that i?-Mod has both enough projectives and injectives (as seen 
in §VIII.6.2 and §VIII.6.3); thus every covariant /contravariant functor i?-Mod — ► 
R-Mod can be derived on the left and on the right. 

Example 7.6. Every R-module N determines a functor _®rN: M >-> M IV 

(see §VIII.2.2). The ie/t-derived functor of _ <8 >k N is denoted _ <2>« N and acts 
D“(f?-Mod) — * D“(R-Mod). The i-th left-derived functor of _®r N, viewed as a 
functor R-Mod — * R-Mod, is Torf (_<8>rN): indeed, the construction of Torf (M, N) 
given in §VIII.2.4 matches precisely the ‘concrete’ interpretation of the i-th left- 
derived functor given above. The reader may note that in §VIII.2.4 we used a 
free resolution of M; free modules are projective, so this was simply a convenient 
way to choose a projective resolution. The fact that we could use any projective 
resolution of M to compute Torf (M, N) was mentioned at the end of §VIII.6.2 and 
was attributed there to the ‘magic of homological algebra’. This piece of magic has 
now been explained in gory detail. 

However, we also mentioned that flat resolutions may be used in place of pro- 
jective resolutions, and this piece of magic still needs to be explained. The same 
applies to the fact, mentioned in §VIII.2.4, that Tor<(M, N) may in fact be com- 
puted by using a projective resolution of N rather than M. Both mysteries will be 
dispelled in §8. j 

31 Projective resolutions of sin object are complexes with nonzero objects only in degree < 0; 
hence L,& = 0 for t < 0. 
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Example 7.7. Similarly, Horn# admits a right-derived functor RHoiur, and its 
manifestations as the right-derived functors of Homjj(M,_) are the Ext modules: 
Extjj(M, N) is (isomorphic to) the i-th cohomology of Homjj(M, IV*), where N* is 
an injective resolution of N. As mentioned in §VHI.6.4, it is also the t-th cohomol- 
ogy of Hoinj?(M*, A r ), where M* is a projective resolution of M. This projective 
resolution should really he viewed as an injective resolution in the opposite category 
f?-Mod op , since the functor Hom«(_, N) is contravariant. 

At this point we understand why the results of these operations are independent 
of the chosen injecti ve/projective resolution. We are, however, not quite ready to 
verify that the two strategies lead to the same Ext functor; this last point will also 
be clarified in §8. 

Also note that we could define the Ext functors as functors to Ab on any 
abelian category with enough injectives and/or projectives: any abelian category 
has left- exact Horn functors 32 to Ab. -i 

7.4. Long exact sequence of derived functors. The most remarkable property 
of the functors Tori and Ext 1 mentioned in Chapter VUI is probably that they 
‘repair’ the lack of exactness of ®, Horn, respectively, in the sense that they agree 
with these functors in degree 0 and they fit into long exact sequences. We stated the 
existence of these exact sequences in §VIII.2.4 and §VIII.6.4, without proof (save 
for indications in case the base ring R is a PID); now we are ready to understand 
fully why these sequences exist, in the general context of derived functors. 

We will keep assuming that A has enough projectives. We will prove that every 
short exact sequence 

0 >L > M >} V- 4 0 

in A induces a ‘long exact sequence of derived functors’; the sequences for Tbr and 
Ext encountered in Chapter VIII will be particular cases. Surely the reader expects 
this general fact to follow one way or the other from the long exact cohomology 
sequence (Theorem 3.5); that reader will not be disappointed. 

From a more sophisticated perspective, what happens is that derived functors 
ht the vertices of a ‘distinguished triangle’ in the derived category: as we mentioned 
in passing in §4.2, these triangles play the role of exact sequences in the homotopic 
and derived categories, which do not happen to be abelian. Distinguished triangles 
give rise to long exact sequences, in much the same way as do exact sequences of 
complexes in the abelian case explored in §3.3. 

Since we do not have the machinery of triangulated categories at our disposal, 
we have to resort to bringing the action back to the ordinary category of complexes, 
with the aim of using Theorem 3.5. The key point is therefore the following: 
assuming that A has enough projectives and that 

0 >L > N M) 

is an exact sequence in A, can we arrange for projective resolutions of L, M, N to 
form au exact sequence in C(A)? 

32 In fact, the presence of injectives or projectives may be bypassed if we adopt the ‘Yoneda* 
viewpoint on Ext. 
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Yes. This is often called the ‘horseshoe lemma’, after the shape of the main 
diagram appearing in its proof. 

Lemma 7.8. Let 

(*) 0 >L >M >N >0 

fee an exact sequence in an abelian category A with enough projectives. Assume 
*i. Pj!f are projective resolutions of L } N, respectively. Then there exists an exact 
sequence 

(**) 0 >Pl r > 0 

where PJfj is a projective resolution of M, inducing (*) in cohomology. 

By ‘inducing (*) in cohomology’ we mean that the (not too) long exact coho- 
mology sequence induced by (**) reduces to the H° part, since all other cohomology 
objects of a resolution vanish; identifying the zero-th cohomology of the resolutions 
with the corresponding objects, this part is nothing but the original short exact 
sequence (*). 

Proof. The hypotheses give us the solid part of the diagram 


0 >L > M HV >0 

i I 1 

o — >pg >ph — >o 



and our task is to fill in the blanks with projective objects and morphisms so that 
all rows are exact, and the middle column is a resolution of M. We claim that 
we can let Pfa := P£ ® P,v > this is projective (Exercise 5.4), and the standard 
morphisms make each sequence 

0 >Pl >PL®P}i ►Pfc >0 

exact, so all we have to produce are morphisms giving an exact complex 

* P [ 2 © Pn 2 » P £ 1 ® 1 * Pi © Pit * M ► 0 

fitting in the commutative diagram displayed above. 

As always, the construction follows an inductive pattern. Since Pjy is projective 
and M — *■ N is an epiniorphisin, the morphism P& — * N lifts to a morphism 
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d : Pjl — * A/. On the other hand, P" maps to M via o : Pj| — * L — * M. Thus, 
there is a morphism 


ttai := a © d : P°®P%—,M, 


and the diagram 


0 

0 


4 L ► M » N 

Jfc 2fc 



; 

- 


*/. 


*.\l 

IT AT 


*pi — > pt ® Ft — >n 


40 


40 


coiniiiutes by construction (note that the composition PjJ ® P$ — • M — N is 
0 on the PJJ factor, by the exactness of the top row). The morphism i r,\/ is an 
epimorphism, by an immediate application of the snake lemma. 

The snake lemma and the fact that 7r l is an epimorphism also imply that the 
kernels of the vertical maps form an exact sequence, and we have epimorphisms 
from P[ l and V to the corresponding kernels since the columns of the original 
diagram are exact: 

0 ► ker 7T/. > ker > ker 4 0 



p- i p-i 

" I. r N 

We let P A ~/ = P,; 1 ® P^ 1 and define 7r A / : P [ 1 © P^ 1 — » ker — * Pj} ® P$ by 

the same mechanism used above: 


0 

0 

0 


4 A/ 



-> P£ 4 P}’ ® P£ 

1 'I 

,p-. 4P-'®P- 



4P« 

I 

L P-1 


40 

40 

40 


Bv the snake lemma, P L 1 ® Pn' — * ker it m is an epimorphism, and this implies 
exactness of the central column at PfJ © P v • Continuing inductively constructs Ph 
as required. □ 


Lemma 7.8 tells us that we can lift exact sequences of objects to exact sequences 
of projective resolutions. Morally, we would like to say that the functor assigning 
to every object of A a projective resolution in K(A) is ‘exact’; but as K(A) is not 
abelian, this is simply not an option. Lemma 7.8 gets as close as possible to such a 
statement. In fact, from the construction of P* ; given in the proof, it is easy to show 
(Exercise 7.5) that P*, is the mapping cone of a morphism p' : P/v[-l]* — » P*. so 
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that we obtain a ‘distinguished triangle’ 



Pn < — MC{pY = Pm 

in the sense mentioned in §4.1. These are the triangles giving the homotopic cat- 
egory the structure of ‘triangulated category’. Thus, the functor & allows us to 
construct a distinguished triangle in K(P) starting from a ‘special’ exact triangle 

l(L) 



l(N) i i(M) 

in the sense of §3.4. The horseshoe lemma shows that this functor preserves as 
much exactness as is allowed by the context. 

We should also note that once short exact sequences are lifted, we can in fact 
lift every complex; this will be needed later. Here is the precise statement: 

Corollary 7.9. Let 

MV * M~ 3 > M -2 * M" 1 * M° ♦ 0 

be a complex in an abelian category A with enough projectives. Then there is a 
complex of complexes: 

P\f. ■■ ► PJli-3 * Pm-* * Pm-' * Pm« + 0 

such that each Plv is a projective resolution of M l and inducing M* in cohomology. 
If M* is exact, then may be chosen to be exact. 

Proof. Break up M* into short exact sequences 

0 > K ’ > M* > 7 <+1 ♦ 0 

together with exact sequences 

0 > P * K l * H l * 0 

where K' is the kernel of M‘ — * M ,+1 , I‘ +1 is its image, and H' is the cohomology 
at M'. Choose arbitrary projective resolutions P/i, P# . of /* and H\ for all i. 
Then the horseshoe lemma (Lemma 7.8) yields projective resolutions of K' and 
short exact sequences 



with evident notation, and then the horseshoe lemma again and the previous se- 
quences give projective resolutions P«, of M' and short exact sequences 



0 


0 . 
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The P* ; , can now be assembled into a sequence, by using differentials obtained by 
the compositions 

P A V 

It is clear that one obtains a complex and that P£ ( , resp., P*, + , , are the kernel, 
resp., the image, of P* 7 , — * P^, +1 (Exercise 1.21). The statements follow. (If 
M* is exact, then 7* = K', and the construction gives P*. = P^., , so that Pjy. is 
exact.) □ 


Remark 7.10. The proof of Corollary 7.9 shows that the resolution 


^-1 




-* 0 , 


can in fact be chosen so that imd*, ker d ‘ , and the cohomology kerd'/ imd' -1 are 
all projective resolutions of the corresponding objects from A7*. This is useful in 
some applications. Resolutions satisfying this property are called Cartan-Eilenberg 
(or ‘fully projective’) resolutions. _i 


Coming back to the issue at hand, we are one step away from the promised 
long exact sequence of derived functors. The last ingredient is the following: 

Lemma 7.11. Let A be an abelian category, and let 

(*) 0 » L* * M* * P* * 0 

be an exact sequence of complexes in A, where P* is projective for all i. Let & : 
A — ♦ B 6e any additive functor of abelian categories. Then the sequence obtained 
by applying & to (*), 

0 ♦ «*(£•) ♦ &(M % ) * ^(P*) > 0 , 

is exact. 


Note that we are not asking & to be exact in any sense. 

Proof. Since P l is projective, the sequence 

0 * V > M ’ * P* * 0 

splits (see the end of §VIII.6.1). It follows that 

0 * &{V) > ^(M f ) > &(P*) » 0 

is (split and) exact for all i (Exercise 5.11). Exactness of a sequence of complexes 
is determined by exactness at each degree, so this proves the statement. □ 

Now the long exact sequence of right-derived functors is an immediate conse- 
quence of the long exact cohomology sequence. 

Theorem 7.12. Let & : A — ♦ B 6e an additive functor of abelian categories, and 
assume A has enough projectives. Every exact sequence 

* L >M *■ 


0 


N 


+ 0 
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in A induces a long exact sequence 

► L 2 ^(L) 4 L 2 ^(M) • 

S 2 


c 


Li*(L) 


c 


-4 L i&(M) ■ 
S l 


■4 L 2&{N) 


• Li &(N) 


D 


3 


Lo&{L) > L oJS-(M) 4 L 0 &(N) 


-40 


in B. Further, this assignment is covariantly /tinctorial. 


The last sentence means that a morphism of short exact sequences will induce 
a morphism of the corresponding long exact sequences, compatibly with compo- 
sitions. We will leave the diagram chases necessary to prove this to the reader 
(Exercise 7.7). 


Proof. By Lemma 7.8, the given exact sequence is induced by an exact sequence 
of projective resolutions 

0 >Pl >Plf >Pjb >0. 

By Lemma 7.11, the corresponding sequence 

0 > ^(P‘) > ^(P* ; ) > jr ( P* r) > o 

in C(B) is exact. As the cohomology objects of these complexes are precisely the 
left-derived functors of & , the long exact cohomology sequence determined by this 
sequence (by Theorem 3.5) gives a long exact sequence as stated. □ 

Once more in the style of §3.4, Theorem 7.12 tells us that the triangle 


i{L) 



i(N) * i(M) 

induces a triangle 33 


L 



L.J?(A0 < L ,#(M) 

The vertices of this triangle are complexes in C-°(B). 


33 The indexing is potentially confusing: the lower • indicates 'homological' indexing, which 
is opposite to the degree of the corresponding cochain complexes. So corresponds to 

degree — i, which is sent by the northeast arrow to cohoraological degree —i + 1, corresponding to 
L{_i^(L), as it should. 
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A completely similar situation occurs for right-derived functors: if A has enough 
injectives, an exact sequence as above induces a triangle 


R \9(L) 



R'.?(N) * R , ^(M) 


where now the vertices are complexes in C-°(B). 


7.5. Relating .9, L,9, R'.9. The reader may have noticed that & was derived 
to the left to obtain Tor, while Horn was derived to the right to obtain Ext. The 
asymmetry mandating this choice lies in the fact that ® is right-exact, while Horn Is 
left-exact: any additive functor can be derived to the left or to the right (in the pres- 
ence of enough projectives, resp., enough injectives), and the derived functors will 
fit in long exact sequences as proven in Theorem 7.12; but only functors satisfying 
a measure of exactness can be recovered directly from their derived versions. 

To state this fact more precisely, we go back to general considerations for the 
left-derived functor of an additive functor 9 : A — > B, where A and B are abelian 
categories and A has enough projectives, and we now assume that 9 is right-exact. 

Proposition 7.13. Let 9 : A— > B be a right-exact additive functor. Then L,9 = 
0 for i < 0. and Lo-9 is naturally isomorphic to 9 . 


Proof. Projective resolutions P* of an object M of A are in C- U (A): it follows 
that C(9)(P*) is 0 in positive degree, hence so is its cohomology. Since Hi = H~‘ 
(of. 3.1), the first claim follows. 

As for the second, since the sequence 

P~ 1 * P° * M * 0 

is exact, by construction and since 9 is right-exact, the sequence 

.^(P- 1 ) > 9{P°) * 9(M) ♦ 0 

is exact. This induces an isomorphism t >l : \-o9(M) = H°(C(9)(P*)) 9(M). 

If : Mo — * Mi is a morphism in A, by Proposition 6.5 there exists a morphism of 
projective resolutions a* : P* — * P* inducing there is a commutative diagram 

* P„ l * P 0 U * Mo * 0 

a" V 

♦ Pf 1 > ► Mi > 0 
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and H°(a*) agrees with <p. Applying & and truncating, we get a commutative 
diagram 


•SW 1 ) 

npr 1 ) 


9{c a ) 


> &(Mo) > 0 

|^(v) 

>0 


with exact rows since & is right-exact. It follows that the diagram 


Mo) ■ 


Lo& 

Jf(M 0 ) 


Lo^(v) 




4L 0 ^(Mi) 


commutes, proving that v : Lo^ — > & is a natural isomorphism. □ 


Ooing back to Theorem 7.12, we see that if & is right- exact, then the tail end 
of the long exact sequence of left-derived functors for & consists of an application 
of & itself. Thus, the situation in this case is the following: starting from a short 
exact sequence 


0- 


■+L- 


►Af ■ 


■ Al- 


in A, we apply & to obtain an exact sequence 

&{L) > &(M) > P(N) * 0 

in which we lost ‘the 0 on the left’. The long exact sequence saves the day, continuing 
the new sequence into an exact complex: 


L 2 &(L) ■ 


c 


• L 2 ^(M) ■ 
8-2 — 


■ L 2 JP(AT) 




-> Li J f(M) ■ 

<S, 

&(M) - 


-»L i#(N) 


3 

3 


-> &(N) - 


->0 


That is, Li& measures the extent to which & fails to be left-exact, and the 
higher give further measures of this failure. 

Similarly, is naturally isomorphic to & if & is left-exact. In this case, 

applying & to the original sequence gives the exact sequence 


0 > &(L) ► (Af) > &(N) 
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where we lost the 0 on the right; the job of the right-derived functors of & is to 
extend this sequence to an exact complex 




c 

c 


V(M)- 
— So — 


-+^(JV) 


3 


Ri &{L) > RiJP(M) > Ri &(N) 

S 1 


3 


R 2&{L) > R 2 J?(M) > R 2 &(N) 


No identification of & with any of its derived functors should be expected if & 
does not satisfy some exactness property. If & is exact on the nose, then both Lq^ 
and R 0 ^ agree with & (up to natural isomorphism); but this is no cause for great 
excitement, since all other L,^, vanish in this case (Exercise 7.8), As already 
pointed out in Example 7.2, one should not expect to learn anything new from an 
exact functor by deriving it. 


7.6. Example: A little group cohomology. In the beginning of this chapter 
we poiuted out that the general strategy informing the development of homological 
algebra has numerous applications and we mentioned group and sheaf cohomology 
as examples. Here are a few words on group cohomology (sheaf cohomology would 
take us too far). 

How do we extract ‘cohomological invariants’ from a group G‘! Consider the 
category G-Mod of abelian groups endowed with a left-G-action, equivalently, the 
category of left-Z[G]-modules, where Z[G] is the group ring briefly encountered 
in §HI.1.4. Objects of G-Mod may be called G-modules. 

For a G-module M, M G denotes the set of elements that are fixed under the 
action of G: these are reasonably called the invariants of the action. Note that 
M c is ail abelian group carrying a trivial action of G; it is clear that setting 
M —* M a defines a covariant functor - G : G-Mod — * Ab. Both C-Mod and Ab 
are abelian categories, and it takes a moment to realize that G is a left-exact 
functor: the reader should either check this directly or do Exercise 7.16 and then 
remember Claim VIII.1.19. The reader should in fact contemplate why this functor 
is not right-exact: if G acts trivially on a coset [m] of a quotient M/L, there is no 
reason a priori why G should act trivially on a representative m. 

We would have drawn this conclusion without difficulty before delving into ho- 
mological algebra. Homological algebra tells us how to address this type of situation 
and ‘quantify’ precisely the failure of exactness, by means of an appropriate derived 
functor. 

The i-th right-derived functor of a is denoted /f‘(G,_). Therefore, H°(G, M) = 
M G , aud for every short exact sequence of G-modules 

0 >L >M > N >0 

we obtain (from Theorem 7.12) a long exact sequence of abelian groups: 

> IP > M g t 


0 


N g >H'(G,L) >H>(G,M) . 
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Note that M c = HonigjcjfZ, M), where Z is given the trivial module structure. 
Thus, H'(G, M) = Ext^ lc) (Z, M); as we know already (and as will finally be verified 
in §8), this can be computed by an injective resolution of M or by a projective 
resolution of Z. 

Example 7.14. Take G = Z. Then Z[G] is the ring Z[x,x _1 ] of Laurent polyno- 
mials. As Z[x,x _1 ]/(1 - x) = Z, with the trivial action (check this!), the complex 

• • • > 0 > Z[x,x~ l ] — Z[x,x -1 ] t 0 ► • • ■ 

is a free, hence projective, resolution of Z, endowed with the trivial Z-action. Ap- 
plying the (contravariant) Hoiu Z j, . r -ij(_, A/), we see that H*(Z,M) is computed 
by the cohomology of 


■M) 


+ M 


-(I-*) 


+ A/ 


>0 >••• • 


‘Multiplication by x’ on A/ is nothing but the action of the generator of G = Z. 
We conclude that H°{Z,M ) = ker(A'/ M) = A/ z (as we knew already), 


H l (Z, M) = 


M 

(1 — ,r)A/’ 


and H'(Z, M) = 0 for all * < 0 and i >2. 


Example 7.15 (Finite cyclic groups). Let G = C,„ be a cyclic group of order 
rn; then Z[G] = Z[x]/(x’" - 1). Again it is not difficult to produce a projective 
resolution of Z (with trivial action) in the category of Z[C m ]-modules: letting 
N = 1 + xH t-x m_1 = (1 - x’ n )/(l - x), the reader will verify that the complex 


^ Z[C m \ Z[C,„] Z[C,„] iM. Z[C m ) ♦ 0 — ♦ • • • 

has cohomology 5S Z, concentrated in degree 0 (i.e., on the last nonzero term). 
Applying Hom 2 [c,„](_, M), we get the complex 


-to- 


ll-..) 

-t M > M 


■A' -( 1 — ar) 

- L — t M > M ■ 


and deduce that H'(M,C m ) — 0 for i < 0, = M c "‘ , and for all i > 0 


H 2l+l (C,„, M)S! 


W 2i (C m ,A/)S 


ker(A/ ^ M) 
(l-x)A/ ’ 

M Cm 


NM' 


There is a standard free resolution of Z over a group ring Z(G], which leads to 
a concrete description of group cohomology. For a finite group G, of order m, the 
beginning of this resolution goes as follows. Consider the complex 


■ Z[G]' 


<r 


-t Z[G] - 


(t) 


+ Z[G] m 


^0 


+ ••• 
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Here, a basis of Z[G] m * over Z[G] consists of fc-tuples [gj , . . . ,gj..] of elements of G. 
The morphisms in (f) are defined by setting 

d -1 ([ff]) = l-0-l-e<3€Z[C], 

d~ 2 ([gi,g2]) = giN - I0102I + (gil. 

on the bases and extending by Z[G]-linearity, and d° sends every g e G to 1, so 
that d°CC 9 s<j< l »ff) = ]C a 9 - (Needless to say, d~ k may be defined for all k.) The 
reader will chedc that (f ) is exact (Exercise 7.17), at least for the part relevant to 
the discussion that follows. Therefore, 

► Z[G] m * Z[G] m -^-4 Z[G] > 0 ► ■ • • 

is (the beginning of) a free Z[G]-resolution of Z (endowed with the trivial G-action), 
and group cohomology can be computed as the cohomology of the complex obtained 
by applying Homj^j (_, M) to this resolution: 

0 -4 Hom Z [G] (Z[ G], M) -4 Hom Z fG](Z(G] m , M) -4 Homz[ci(Z[G] ro2 , M) -4 ■ • • . 

Here, Hom Z [G] (Z[G] m * , M) = M m * may be identified with the abelian group of 
functions G k — » M, since every Z[G]-linear map Z[Gj m -t M is determined by 
its action on a basis. Denote this abelian group by C fc (G, M): this is the group of 
k-cochains of G with values in M. 

We can now summarize what we have found as follows: 

Proposition 7.16. Let G be a finite group, and let M be a G-module. Then the 
group cohomology £T*(G, M) is the cohomology of the cochain complex 

0 ► G°(G, M) — —4 C 1 (G, M) — ^4 C 2 (G, M) — — > ■ • • 

induced by (f). 

Tracing definitions, we see that for a € C 9 (G, JVf) = M and a : G —* M 
in C l (G,M), 

dc(o)(g) = ga- a, 

dh(<*)(gi,9l) = Siafe) - a(siS 2 ) + «(gi). 

Example 7.17. Let G be the Galois group of a finite Galois extension fc C F. Then 
G acts on the multiplicative group F* of F, and we can view F* as a G-module. 

Claim 7.18. F I (G,F*) = 0. 

Indeed, with notation as above we have H 1 (G, F*) 3 kerd^/imdg, and we 
can compute this quotient explicitly. Let a € G*(G, F*); denote by a g the image 
of g in F* by a. The ‘cocycle condition’ a € kerdjj translates into 

<*gh = ctg ■ g{a h ) 

for all g, h in G. (We are now writing the operation in F* multiplicatively and 
viewing elements of G as automorphisms of F.) 
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The elements h € <3 are pairwise distinct as automorphisms of F, so by Exer- 
cise VII.6.14 they must be linearly independent. Therefore, there exists a 7 € F 
such that 

p : = afc ■ h (?) ^ °- 

h€G 

Thus fi e F\ and for all g e G 

9(0) = 9(<*h) ■ (9h)(7) = Y, a g la 9h • (9h)( 7 ) = <*g l fi 

h€G h€G 

where the equality marked = holds by the cocycle condition. That is, we have 
found that there exists a fi e F* such that 



for all g G G. But this says precisely that a is in the image of dg, proving that 
kerdc/irndo = 0 , as claimed. 

Claim 7.18 is significant: it goes under the name of Hilbert's theorem 90, be- 
cause in the case in which G is the Galois group of a finite cyclic extension, it 
recovers precisely the classical result with this name. The reader had the opportu- 
nity to prove this particular case in Exercise VII.6.16 and now will enjoy the chance 
to verify that indeed Claim 7.18 implies Hilbert’s result (Exercise 7.18). _j 


Group cohomology' is a useful tool in algebraic number theory, algebraic topol- 
ogy, and other fields, and so is the companion ‘group homology' H,(G, M), defined 
analogously by the left-derived functors of the functor g associating with every 
G-module M the module 


M a ~ 


M 

{gm - m) 96G 


(just as the elements of M G are called invariants, elements of Mq are called co in- 
variants of the action). For example, in Example 7.14 we verified that H 1 (Z, M) = 
Mi — Ho(Z,M). The reader should now have no difficulty computing simple ex- 
amples, such as the homology H,(Z, M) (Exercise 7.19). 


Exercises 


7.1. > Let 2F : Ab — + Ab be the additive functor defined by tensoring by Z/2Z: 
&(A) := A ® z Z/2Z. Prove that no nontrivial projective abelian groups can be 
written as &(P), for any abelian group P. (§7.1] 

7.2. Let A, B be abelian categories, let : A -> B be an additive functor, and let 
C(<^) : C(A) — » C(B) be the corresponding functor of complexes. If a* : L* — » M* 
is a morphism in C(A), prove that C(^) sends the mapping cone of a* to the 
mapping cone of C(«^)(a*). 
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7.3. t> With notation as in Proposition 7.3, prove that the natural transformation 

L& o ° K(^) is not necessarily an isomorphism. (Hint: View the 

quasi-isomorphism in Example 4.6 as a projective resolution in Z-Mod, and take 
& : Z-Mod - (Z/2Z)-Mod to be _® z (Z/2Z).) (§7.2] 

7.4. t> (Cf. Remark 7.4.) Let A, B, C be abelian categories, and let & : A — * B, 
<£ : B — > C be additive functors. Assume that A and B have enough projectives, 
so that the derived functors L? and L <£ are defined, as well as the derived functor 
L(# o JF), Further, assume that & maps projective objects to projective objects. 
Prove that L& o LF and L(^ o &) are naturally isomorphic. [§7.2] 

7.5. t> With notation as in Lemma 7.8, show that there is a morphism p* : 
Pjv(— 1]* — * P[ such that Pjf, is homotopy equivalent to the mapping cone MC(p)‘. 
(Hint: Look at the proof of the lemma; use the differentials P[(BP’ n — ♦ P[ +l ®P^ +1 
to define morphisms P' N — » P£ +1 .) [§7.5] 

7.6. -> Let B, C be abelian categories, and assume B has enough projectives, so 
that Cartan-Eilenberg resolutions can be constructed, as in Corollary 7.9 (cf. also 
Remark 7.10). Let # : B -♦ C be an additive functor, and let M* : ■ ■ ■ M~ 2 — * 
M~ l — * M° — * 0 be a complex in B. Let PI,.'- • • • — » P ',-2 — > P^-i — * Pjf /U — * 0 
be a Cartan-Eilenberg resolution of M*. Prove that # maps the cohomology of 
this complex isomorphically to the cohomology of the complex • • • — » ^(P‘,- 2 ) — * 
^(PJ(,.i) — * &(PJ[, u) — * 0. (Hint: With notation as in the proof of Corollary 7.9, 
for all i and j we have exact sequences 0 — * P ] K , — * Pj ;i — > Pj i+l — ► 0 and 

0 — * Pj, — * P } K , — > P 3 h , — * 0. Note that these necessarily split.) 

Stenographically, &(H q (Pl,.)) = H q (^(P^,.)), where cohomology is computed 
with respect to the Af-degree. Also note that, by construction, //’(Pjf,.) is a 
projective resolution of H q (M*); cf. Remark 7.10. This will be an ingredient in 
the comparison of derived functors for the composition of two functors by means 
of spectral sequences; cf. Exercises 8.8 and 9.10. [9.10] 

7.7. > Complete the proof of Theorem 7.12 by showing that morphisms of short 
exact sequences induce morphisms of the corresponding long exact sequences of 
derived functors. (§7.4] 

7.8. > Let A, B be abelian categories, and let & : A — * B be an exact functor. 
Assume A has both enough projectives and enough injectives, so that and 

are both defined. Prove that and R‘.J? are both the zero functor for 

1 ji 0. [§7.5] 

7.9. d> Let A, B be abelian categories, and let & : A — * B be an additive functor. 
Assume A has enough projectives, so that is defined. Prove that if P is 
projective, then L 0 ^(P) = &(P) and L,^(P) = 0 for i / 0. [§8.1] 

7.10. Let A, B be abelian categories, and let & : A — * B be an additive functor. 
Assume A has enough projectives, so that is defined. Prove that Lo(^) is a 
right-exact functor, determining the same higher derived functors L,, i > 0, as & . 

7.11. -> Let A, B be abelian categories, and let & : A — > B be an additive functor; 
assume A has enough projectives. Let 

0 >K >P >A >0 
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be ashort exact sequence in A, with P projective. Prove that L i&{A) = 
for all i > 1. [7.12] 

7.12. > Generalize Exercise 7.11 as follows: with the same notation, assume 

0 > K > B~ k » * B' 1 * A > 0 

is an exact sequence in A, such that = 0 for » > 0, 1 < j < k. Prove 

that L i&{A) S* L i-k&(K), for all i > k. [§8.1, 8.11] 

7.13. -i Let A, B be abelian categories. A collection of functors : A -+ B, i > 0, 
is called a (‘cohomological’) 8-functor if 

• every short exact sequence 

(t) 0 >L > M >A r >0 

in A determines ‘connecting morphisms’ 

8 i : 3T\N) & i+1 {L) 

such that the induced sequence 

(t) 0 

C 

Cy i+ 1 (i) 

is exact; 

• the assignment of a complex (J) for every short exact sequence (f) is functorial 
(in the sense specified in Exercise 3.11). 

Prove that for every additive functor & : A — > B, the derived functors R 4 .^, i > 0, 
form a cohomological 5-functor. Prove that cohomology itself is a 5-functor from 
C-°(A) to A. 

Define a notion of ‘homological’ 5-functor {^i}, and prove that left-derived 
functors give an example. [7.14, 8.14] 

7.14. -i ‘Morphisms’ {^’} — ► {y*} of 5-functors (Exercise 7.13) are natural trans- 
formations of the individual functors S' 1 •'-> y 1 that induce morphisms of the cor- 
responding long exact sequences (f), for every short exact sequence (f) (in other 
words, that preserve the connecting morphisms 5). 

A 5-fiinctor {.^‘} is universal if for every 5-functor {.?’*}, every natural trans- 
formation y° extends to a unique morphism of 5-functors {^’’} -+ {y'}. 

Universal 5-functors are of course unique up to isomorphism. 

It is not hard (but it involves a fair number of diagram chases) to show that a 
5- functor {&*} is universal if for every i > 0 and every object A of A there exists a 
monomorphism i : A — > B for some B, such that & l (i) — 0. This makes each 
i > 0, effaceable. Figure out how the proof begins: assume that {■S’ -1 } and {y'} are 
5- functors and that S’ 1 is effaceable; show how to extend a natural transformation 


-+ y°(M) ■ 
S° — 


• y°(N) 


-4 > y*(N) 

-5* 


3 

3 
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y 0 to a natural transformation 3T X ~+ S' 1 , compatibly with the connecting 
morph isms. (Hint: If !? x is effaced by i : A — > B, let C — coker i and consider the 
exact sequence 0— * A — ► B — > C -< 0.) [7.15] 

7.16. -i Assume the result stated in Exercise 7.14. Prove that right-derived func- 
tors are universal cohomological 5-functors, (Equivalently, left-derived functors are 
universal homological 5- functors.) For an abelian category A, Prove that cohomol- 
ogy, viewed as a collection of functors from C-°(A) to A, is a universal 5-functor. 
(Hint: To efface V& for i > 0, use injectives. Tb efface H' for i > 0, stare at the 
sequence of complexes mentioned right before the statement of Proposition 4.1.) 

The upshot is that in order to verify that a given collection of functors agrees 
with the derived functors of a given (say) left-exact functor, it suffices to verify that 
they form a cohomological 5-functor and that the effaceability condition holds. This 
is used, for example, to obtain concrete realizations of sheaf cohomology. [8.14] 

7.16. > Prove that the functor ■ G defined in §7.6 is right-adjoint to the functor 
Ab — > G-Mod associating with an abelian group A the same group, endowed with 
the trivial (7-action. Give an analogous interpretation for the functor a- [§7.6] 

7.17. > Consider the complex (f) defined in §7.6. Define maps hr 1 , h°, h in the 
diagram 



by 

h(a) := a - e c , h°(g) := [$], h -, ([fl]) := [«g, <?] 

(and extending by Z[G]-linearity). Use these maps to verify that the complex (f) 
is exact at Z, Z [G], Z[G] m . [§7.6] 

7.18. > Deduce Hilbert’s theorem 90 (as stated in Exercise VII.6.16) as a conse- 
quence of the vanishing of Claim 7.18. (Hint: Use the result of Example 7.15). 
[§7.6] 

7.19. l> Compute the group homology Z, M) for any abelian group M carrying 
a Z-action. [§7.6] 


8. Double complexes 

A few mysteries remain on the table concerning Tor and Ext: the fact that, e.g., 
flat resolutions may be used in place of projective ones in order to compute Tor and 
the fact that resolving ‘either argument’ leads to the same functors. For example, 
Ext’ R (M, N) may be computed by using a projective resolution of M or an injective 
resolution of N. 

Both mysteries are (of course) instances of general features of the theory, which 
we examine in this section. The first can be dispelled with a pleasant inductive 
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argument that seems worth presenting (§8.1). But everything is clarified substan- 
tially by going one step further and considering double complexes, as we do in the 
remaining subsections. 


8.1. Resolution by acyclic objects. Let A be an abelian category with enough 
projectives, and let & : A — > B be an additive functor to another abelian category. 
At this point we know how to construct left-derived functors L : A — * B of & : 
in two words, is computed by applying & to a projective resolution of M 

and taking (oo)homology of the resulting complex. 

It follows in particular that L ,&(P) = 0 for i > 1 if P is projective (cf. Exer- 
cise 7.9): if P is projective, then r(P): 

>0 10 IP 10 1 • • • 

is a projective resolution of P, so L.^(P) is the cohomology of 

10 10 >&(P) 10 !■•• ; 


that is. 


L i&(P) = 


^(P), 

0 , 


i = 0, 

i^0. 


This says that projective objects are ‘^-acyclic’ with respect to left-derived func- 
tors: 


Definition 8.1. Let & be an additive functor. An object M of A is acyclic 
(w.r.t. left-derived functors) if — 0 for i 0. j 


An object M is ^-acyclic with respect to rfyfcf-derived functors if R *&(M) = 0 
for t ^ 0. In practice, this notion is really useful only for right-exact or left-exact 
functors; the first are derived on the left, and the second on the right (cf. §7.5), so 
in context we can just talk about ‘.^-acyclic’ objects. 

While projectives are acyclic for every right-exact functor, a given functor & 
may admit other ^-acyclic objects. 

Example 8.2. Let B be a commutative ring. Recall (Definition VIII.2.13) that an 
B-module M is flat if _ ®/j M is exact, or equivalently (by the symmetry of ®) if 
M ® j? _ is flat. Flat modules are acyclic with respect to tensor products, in a very 
strong sense: if N is fiat, then Tor,-(M, N) = 0 for i ± 0 and all M; see §VIII.2.4 to 
be reminded of why, or— better — prove it anew. Further, since (as we will finally 
prove in this section) Tor functors may be computed by resolving either argument, 
we also know that, ‘symmetrically’, if M is fiat, then Tor* (M, N) = 0 for all i / 0 
and all modules N. Thus, a fiat module is ^-acyclic for every functor & defined 
by _®R N, for all modules A r . _i 


Here is the punch line. In §VIII.6.4 we had claimed that flat resolutions could 
be used in place of free or projective resolutions, in order to compute Tor. We are 
going to verify that & -acyclic resolutions suffice in order to compute L i& . 
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Theorem 8.3. Let & : A — * B be a right-exact functor of abelian categories, and 
assume A has enough projectives. Let 

A*: > A -2-5L-> A -iJ±-> A 0 > 0 »... 

be a resolution of an object M of A, such that every object A* is & -acyclic. Then 
for all i 

Li &(M) *£ 

If the objects A 1 are projective, the statement is the very definition of Lj&. The 
theorem states that .^-acyclic objects may be used in place of projective objects 
in the computation of Li-F. 

Proof. The first two cases follow from explicit computations. To begin with, the 
sequence 

A” 1 — > A 0 — > M — > 0 
is exact by assumption, and & is right-exact; thus 

> &( A 0) » J*r(M) t 0 

is exact, and it follows that 

H°(C(&)(A*)) = &{M) = L 0 *{M). 

Next, let K be the kernel of A 0 —> M\ so we have the short exact sequence 

(*) 0 >K + A° to. 

Applying Theorem 7.12 (and Proposition 7.13), we obtain a long exact sequence 
ending with 

Li^-(A°) t L i&(M) t &(K) > &(A°) > &(M) t 0 . 

The leftmost term is 0 since A 0 is ^-acyclic. It follows that 

Li^(M) = ker(^ (K) -> &(A 0 )) = ff-^C^HA*)) : 

for the second =, apply the result of (the straightforward) Exercise 8.1 to the exact 
complex 

> A- 1 > M to • 

Finally, use induction on i. We have just verified that the theorem holds for all 
objects M and for i = 0, 1; given i > 1, assume the statement is known for all 
objects of A and all indices < i. IVuncating/shifting A*, 

>A~ 3 >A~ 2 tA- 1 to >••• 

(so that A -1 is placed in degree 0), gives an ^-acyclic resolution of K; by the in- 
duction hypothesis, Lj_i#'(li') may be computed by applying C{&) to this complex 
and taking cohomology: 

^ H~\A*). 

On the other hand, the other terms in the long exact sequence obtained by applying 
Theorem 7.12 to (*) give 

> LiJf(A 0 ) 1 L ^(M) > U-iP(K) > Lf-jf^A 0 ) 1 • ■ • ; 
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since A 0 is .^-acyclic and i > 1, this shows that 


(also cf. Exercise 7.12) and concludes the proof. □ 

Example 8.4. Let R be a commutative ring, and let 

F. : •• • " - -■ > J>2 > F x > Fo > 0 

be a resolution of an F-module M by flat i7-modules. Then for every F-module N, 
Torf (Af, N) S Hi(F. ® R N). 

Indeed, Hat modules are acyclic with respect to _®N (Example 8.2), so this is now 
a consequence of Theorem 8.3. j 


Theorem 8.3 raises an interesting possibility: since ^-acyclic objects suffice in 
order to compute the derived functors of ? (at least when & is right-exact), the 
reader can imagine that there may be situations in which A does not have enough 
projectives, and yet left-derived functors of a functor & may he defined because A 
has enough ^-acyclic objects (in a suitable sense). This is indeed the case. The 
reader will run into such examples in more advanced treatments of the subject. 
(See Exercise 8.14 for a typical situation.) 

There is an alternative viewpoint on the question addressed by Theorem 8.3. 
Let A m be an ^-acyclic resolution of an object M, and let n, be a projective 
resolution. We will apply C(^) and place the resulting complexes as sides of an 
array: 




&(A~ l ) 


•>v 




* &(A a ) 




*&(M) 


I 

1 

I 

•np u ‘) 

1 


The result of Theorem 8.3 is that taking cohomology of the top row of this diagram 
gives the same result as taking cohomology of the rightmost column. Might there 
not be a way to ‘interpolate' between these two cohomologies, by cleverly filling in 
the dotted portion of this diagram? 

Double complexes may be used to this effect, as we will see in a moment. 
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8.2. Complexes of complexes. We warned the reader as far back as §3.2 that we 
would examine complexes of complexes, that is, objects of C(C(A)), for an abelian 
category A. We have run across particular cases: the exact sequences of complexes 
leading to the long exact cohomology sequence (§3.3) and the construction of the 
mapping cone (§4.1). The latter case generalizes in a straightforward way, as we 
will see now. 

Keeping with the tradition established earlier in the chapter, we will concentrate 
on the ‘bounded-above’ case; the reader should have no difficulty reproducing the 
bounded- below version of what we will do and should be aware of the fact that the 
material can be developed to a large extent without boundedness conditions. 

An object of C-°(C-°(A)) is a complex 



in which the object A/*’* in degree i is itself a complex in C-°(A). The subscript 
h stands, rather unimaginatively, for ‘horizontal’; we have = 0. An 

example that will motivate the next construction is the particular case in which 
only two objects are nonzero: 



(and we are considering the particular case in which L*, M * are bounded above). 
We will (arbitrarily) place M* in degree 0 and L* in degree — 1. We can view the 
information carried by (*) as a commutative diagram: 



I *‘t h" i 

0 — > L i+1 - 2 — » M i+l > 0 

1 *4 . b I 

0 > I? — 2 — > M® ► 0 

1 4 i I*' I 

1 I I 1 



The mapping cone MC(a) 1 constructed in §4.1 is obtained by 

• shifting the complex in degree — i by i, 

• direct-summing the rows and morphisms of the resulting diagram, collapsing 
it to a single vertical complex: 
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Note that, due to the shift, the diagram on the left has become anficommutative 
in the process. The extra signs are necessary in order that MC(q)* be a complex. 
Now we will apply the same procedure to the more general case 


■ M~ 3 '* 



A/" 2 -* 



M°'* 


■M) ♦••• • 


This corresponds to the commutative diagram 


-» M~ 3 - 0 — M~ 2 -° —**-► M~ l '° M°-° 



♦ M~ 3 -~ 2 — ^ A/" 2 - 2 M~ l '~ 2 M°'~ 2 


surrounded by 0 above and to the right (omitted here not to clutter the diagram; the 
upper indices of the differentials are sacrificed for the same reason); the subscript v 
stands for ‘vertical’. As above, we shift M~'-' by i, with the effect of changing 
the sign of the differentials in the odd-shifted columns; we will let 8 V denote d v 
on even-degree columns and -d v on odd-degree columns. The resulting diagram is 
anticommutative: 
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Aiy 

A/ 00 


M°-' 



M-2.0 

M 0.-2 


^.J 

AT 2 - 1 M~ x ~ 2 

M 0.-3 



Taking direct sums of the objects and morphisms on each row, as in the case of the 
mapping cone, determines a new complex, called the total complex TC(M)* of the 
given complex of complexes. 

Double complexes capture this construction: 

Definition 8.5. A double (cochain) complex on an abelian category A is an array 
of objects M 1J of A, i, j 6 Z, endowed with morphisms 

4' J : - M i+lJ , 6? : M i j - M ,J+1 


such that 

' 4 '° 4 -^= 0 , 

■ sy°w-' = 0, 

.4 +l -»o4 j + 4 i+1 o4^ = 0. 

That is, the columns M 1 -* and the rows M* J both form complexes, and the 
whole diagram anticommutes. Dropping the position specification, the conditions 
defining a double complex are summarized in the easier-to-parse prescription 

dhcdh = 0 , 

• 6 V o S v = 0, 

S v o dh + dh o 6 V = 0. 

Double complexes, with or without appropriate boundedness conditions (such as 
requiring A/' - -’ = 0 for i > 0, j > 0, as in the case considered above), form a 
category in an evident way. It is equally evident that this category is equivalent 
to the correspondingly bounded category of complexes of complexes: every object 
of C-°(C-°(A)) determines a double complex by setting = (— as above. 
Changing the signs of odd rows rather than columns leads to an isomorphic double 
complex, Exercise 8.4; in fact, other choices may be convenient depending on the 
situation. 

From this viewpoint, the total complex TC(M)* (also called the simple of the 
double complex M*'*) is obtained by direct-summing objects and morphisms of the 
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double complex along diagonals: 



i i i 


A/0.0 

A /- 1 - 0 ©*/ 0 ' -1 

J«ci 

A / -20 © A / -1 - 1 © A / 0 - 2 
M - 3 '°< DM - 2 --' © A /-‘'- 2 © A / 0 '- 3 

i 


Definition 8.6. The total complex TC(M )* of a double complex A/* - * is defined 
by setting TC(M) k := ® i+J=A . A/'- J , with differentials (written stenographically) 
dtot = dh + <$»■ j 

Claim 8.7. (TC(M)* ,d m lot ) is indeed a complex. 

Proof. The question is whether the composition of two consecutive differentials 
vanishes, as it should. Insisting with our shorthand, 

dtot 0 dtot = (d/i + <5„) »(d/, + S v ) = dh o dh + dh ° S v + <5t> ° d/i + <5« o <5„ =0. 

The reader will formalize this computation (Exercise 8.3). □ 


It is clear from the definition that TC(M )* agrees with the mapping cone, 
when A/*-* is concentrated in degrees -1 and 0. 

The total complex is defined even without any boundedness hypothesis, but 
then one has to choose whether to use direct sums or direct products as objects of 
the complex. Since only bounded complexes will be needed in the applications we 
will see, we will happily leave such complications aside and limit the discussion to 
the bounded case. 


Example 8.8. Operations such as HoniA or ® determine double complexes. For a 
‘first quadrant’ example, let £*, resp., M*, be a complex in C-°(A), resp. C-°(A): 


-*U 


2 _ 

— * L 1 “ 

— * L° — 

>0 — 


. 

— >o — 

— > ~ 

— ► A/ 1 - 

— > M 2 ~ 


We can consider the complex in C-°(C-°): 

► 0 — * Hom A (£-*. M°) — ► Hom A (I*, M l ) — ► Hom A (L*, M 2 ) 
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that is, in ‘grid’ form (and omitting zeros to the left and below): 


t t t 

Hom*(L _s , M°) > Horn a(L _s , M 1 ) > Hom A (L" 2 ,M 2 ) > • • • 

i i i 

Hom*(L _I , jW°) ¥ HomA(£ -1 ,iW l ) ¥ Hom A (£-\M s ) > ■■■ 

I I 1 

Honu(£°, M°) > HomA(£°, Af 1 ) S- HomA(£°, M 4 ) > ■ ■ ■ 

We could also consider the complex of complexes 

> 0 — > Hom A (L°, M') —> Horn A (£“ 1 , M m ) —¥ Hom A (L~ 2 , M m ) — ¥■■■ 

and this would lead to the grid obtained by hipping the previous one about the 
main diagonal. The two corresponding total complexes are then clearly isomor- 
phic (Exercise 8.4), and by a slight abuse of language they can both be denoted 
TC(Hora A (L, M))*. The degree-t piece of this total complex, i.e., 

Hom A (I“ i , il/°) © Hom A (IT i+ \ M 1 ) © • - - © Hom A (L°, AT), 

parametrizes degree-preserving morphisms from objects of L* to objects of 
These are not cochain morphisms L* — » M[*]*, since the corresponding diagrams 



— ► M[i - <+1 — ¥ ¥M 

i ]° — ¥M 

f 1 1 



Z,-‘ ► L~ i+1 > *■ L° *■ 0 




do not satisfy any commutativity hypothesis. The reader will verify (Exercise 8.5) 
that, with suitable positions: 

• the kernel of dj ot consists of morphisms of cochain complexes L % — + M[*]*; 

• the image of d£ t L consists of morphisms that are homotopy equivalent to 0; 
and hence 

• the cohomology of TC(Hom A (£, AT))* parametrizes morphisms in the homo- 
topy category. 

For (a trivial) example, TC(Hoin A (I‘,M))° = Hom A (L°, M°), and = 0 

means that the images aed^} in Hom A (L -1 , M 9 ) and in Hom A (L°,M 1 ) 

both vanish: 
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This is precisely the condition needed for a to define a morphism of oochain com- 
plexes L* — * M*. It follows that in this situation i/°(TC (Horn a (£, M))‘) is simply 
Hom C ( A )(£*, M*) = Horu K (A)(£*> A/*) (there are no nontrivial homotopies in this 
case). 

The identification of the i-th cohomology of the total complex with the Horn-set 
Hom K (A)(£*) M(i]*) requires care with the signs 34 . j 

8.3. Exactness of the total complex. The cohomology of the total complex of 
a double complex can be computed by an extremely clever device called spectral 
sequence, which we will (briefly) encounter in §9.3. There are standard situations, 
however, in which one can conclude immediately that the total complex is exact, 
and this is at the root of the applications we will present. The statement is neat 
and memorable: 

Theorem 8.9. Let A be an abelian category, and let 

(*) » M" s >* * M~ 2> * > M” 1 ** * + 0 > • • • 

be a complex in C-°(C-°(A)). Let I* be the corresponding total complex. Then 

• if (*) is exact, then T* is exact; 
m if each complex M *•* is exact, then T* is exact. 

The proof will yield a more precise statement on the vanishing of individual 
degrees of the cohomology of the total complex (Exercise 8.6). 

Mutatis mutandis, Theorem 8.9 holds for complexes in C-°(C-°(A)), with mi- 
croscopic adjustments to the proof; we will take this for granted in the applications. 
In fact, the boundedness hypothesis can be relaxed: one of the two given exactness 
statements will still hold if only one of the C is bounded. The point is that the 
proof relies on an inductive argument, which goes through as soon as it has a place 
to start. The reader is welcome to explore all these ramifications. 

In terms of double complexes. Theorem 8.9 says that the total complex of a 
(suitably bounded) double complex is exact if the rows or the columns of the double 
complex are exact. 


Proof. It is enough to prove the second statement: the first one follows by flipping 
the double complex corresponding to (*) (cf. Exercise 8.4). 


34 A clever way out of the sign quagmire in this computation is to choose another way to 
get a double complex out of HomA(£*,M*). For example, replacing the vertical differentials fit’* 
by (— 1) 1+ ^ 1 elf’ still makes the array anticommutative as needed and irons out annoying sign 
mismatches elsewhere. 
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To prove the second statement, we assume that the columns in the staggered 
(anticommutative) diagram 

0 

I 

0 A/ 00 

iX‘i 

0 A/" 10 A/ 0 - 1 

1 

0 A/* 20 A/-'-' A/ 0 ' -2 

1 s'**-] 

A/ -30 A/- 2-1 A/~ l '~ 2 A/ 0-3 

1/^ 1/^ 1/^ I 

associated with (*) are exact and that we have an element 

m = (m°, m" 1 , • • • , m _< ) € M - *' 0 ® M" i+1 © • • ■ ® M 0-- ' 

such that d^,' t (m) = 0; we are expected to produce an element 

n = (n°, n" 1 , • • • .tT*" 1 ) € JW - ' -1-0 ® M~ i ~ 1 ffi M~ i+i '~ 2 ® • • • ® 
such that d^,t -1 (n) = m. 

The little induction that accomplishes this is conceptually very simple, but 
notationally challenging. We believe that viewing it in action in a simple case (we 
will choose t = 2) suffices to convey all that needs to be conveyed. Thus, assume 
that m = (m°,m -i ,m -2 ) is in the kernel of d tot ; that is, 

f dfc(m°) + <5„(m _1 ) = 0, 

| d/,(m -1 ) + <5„(m -2 ) = 0. 

We choose n° = 0. Since A/ -2 -* is exact, there exists n -1 € M~ 2 '~ l such that 
<5„(n -1 ) = m°. Since the diagram anticommutes, 

6 v «dh(n~ l ) = -d ft o(5„(n -1 ) = -dh(m°) = <5 v (m -1 ). 

Therefore, m~ 1 — dh(n~ l ) e ker <5 V , and since M -1 ’* is exact, there exists n~ 2 6 
A/ -1--2 such that 6 v (n~ 2 ) = m -1 — d;,(n -1 ). That is, 

d/i(n -1 ) + 6 v (n~ 2 ) = m -1 . 

The next step is entirely analogous: 

6 v od/ t (n~ 2 ) = -d/,o(5„(n _2 ) = -d/,(m -1 -d(,(n _1 )) = -d h (m _1 ) = A„(m -2 ) 

(using the fact that d/, ° d/, = 0); therefore m -2 - d/,(n~ 2 ) € ker <5„, and since M 0 -* 
is exact, there exists n~ 3 6 M°~ 2 such that <5„(n~ 3 ) = m~ 2 - dh(n~ 2 ). That is, 

dh(n~ 2 ) + <5 u (n -3 ) = m~ 2 . 
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With these choices, d tot (n) equals 

dh(n°) +<5„(n -1 ) + di,(n~‘) + <S„(n~ 2 ) + d h (n~ 2 ) +6 u (n~ J ) = 0 + m° +m _l +m~ 2 = m 
as needed. 

The case for arbitrary i simply requires additional repetitions of the basic steps 
performed for i = 2. □ 

Example 8.10. Here is a taste of bow convenient Theorem 8.9 is. Let P* be a 
complex in C~ U (A), where each P' is projective, and let L* be an exact complex in 
C-°(A). Since P’ is projective, Horn a ( P‘, L*) is still exact. Therefore, the complex 

> 0 — » Hom A (P°, L*) — > Hom A (P-‘, L*) — > Hom A (P" 2 , £•)—►••• 

satisfies the hypothesis of the second statement given in Theorem 8.9 (in the sym- 
metrically bounded version). According to Theorem 8.9, the corresponding to- 
tal complex is exact; as seen in Example 8.8 (cf. Exercise 8.5), this says that 
Houik( A )(P*, L[i]*) = 0 for all i. In other words, every cochain morphism from a 
bounded-above complex of projectives to a bounded-below exact complex is homo- 
topic to 0. This recovers Corollary 5.12, up to easy adjustments, taking care of the 
boundedness hypothesis on L*. j 

8.4. Total complexes and resolutions. Example 8.10 illustrates well the power 
of results such as Theorem 8.9: the nuts-and-bolts-style arguments used in §5 are 
bypassed, and statements such as Corollary 5.12 are placed in a clarifying context 35 . 
The same strategy will allow us to revisit the question examined in §8.1, letting a 
double complex draw the conclusion of Theorem 8.3. We will then finally be able 
to go back to the last remaining mystery concerning Tor and Ext, dealing with why 
we can compute these functors ‘by resolving either argument’. This will conclude 
a large circle of ideas, begun in Chapter VIII. 

These applications rely on one more remark concerning double complexes and 
their simples. The heavy notation poses a substantial obstacle in appreciating what 
follows; the reader should take comfort in our assurance that notation is the only 
obstacle here— the constructions are actually straightforward. 

We have studied in §6.2 the problem of constructing complexes of projectives 
(for example) that are quasi-isomorphic to a given complex N': Theorem 6.6 ac- 
complishes this. We have called the corresponding quasi-isomorphism a (projective) 
resolution of N *. Now that we arc working in C-°(C-°(A)), this looks like an even 
worse abuse of language: viewing a (bounded-above) complex N* as an object of 
the abelian category A' = C-°(A), a (bounded-above) lesolution of N* ought to be 
a complex A/*'* in C~°(A') endowed with a quasi-isomorphism 

> A/-2 * A/ 0 -* > 0 > ■ ■ ■ 

ill! 

► 0 > 0 > AT* ► 0 > • • • 


35 But we strongly believe in the pedagogic usefulness of first going through the nuts and 
bolts! 
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in C- 0 (A / ). In other -words, a resolution of N* should be a complex M*'* whose 
cohomology (computed in C-°(A')) is concentrated in degree 0 and agrees with N m . 

We are going to verify that these two notions of ‘resolution’ of a complex ore 
compatible (so no language abuse occurs after all): taking the total complex of a 
resolution M*’* of N * (in tire ‘new’ sense of the term) again gives a resolution of IV* 
(in the ‘old’ sense). 

Before stating this more explicitly, consider more generally any cochain mor- 
phism in 38 C-°(C-°(A)) from M*’* to a complex (of complexes) concentrated in 
degree 0: 


■ M -2.. -±-+ M -l,. M 0,. 


-40- 


-40- 


-4 jV« 


-40 4- 


This means that of o = 0, so we can fold this diagram into a single complex of 
complexes: 

Mp* : > M~ 2 '* -^4 M~ 1 '* -—-—4 A/ 0 '* ——4 N* 40 >•••. 


That is, Af]y* agrees with the complex obtained by shifting M*>* by one step to 
the left, with N* inserted in degree 0. For now, we are making no assumptions on 
the cohomology of M*’*. 

We want to compare the various total complexes that can be defined in this 
situation. We can map the total complex of A/*‘* to JV*, 


1 

M°>° - 


-40 


1 

— 4 JV® 

I 

M - 1,0 0 M 0,-1 

]S] I 

M~ 2,0 © M" 1 '- 1 © M °~ 2 4 JV” 2 

I 


obtaining a morphism of cochain complexes 37 

TC(a)* : rC(M*>* ) 4 AT* . 

(This morphism is of course just an instanoe of the evident fimctoriality of the 
construction of the total complex.) 


38 Of course a parallel discussion can be carried out in C-°(C-°(A)), and with other bound- 
edness possibilities. This is left to the reader as usual. 

37 Note that, e.g., M " 1 ' " 1 — * M°'~ l — * N~ l is the zero-morphism (because a m od h = 0). 
The morphism from TC(M m ’*) to N“ only sees M 0,m , so it clearly is a morphism of cochain 
complexes. 
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Or, we can shift the M - part of this diagram and view the last display as the 
construction of the total complex of 


N° 


XI 

m°>° e n- 1 



This involves a sign change in the vertical differentials of the M-part, due to the 
shift. 


Claim 8.11. The total complex of Mlf is the mapping cone ofTC(a)*: 
TC(M N )* = MC(TC(a))*. 


Verifying this claim only amounts to chasing the definitions, so it can safely be 
left to the reader (Exercise 8.7). The following consequence compares the different 
notions of ‘resolution’, as promised a moment ago. 

Theorem 8.12. Let A be an abelian category, and denote by A' the category 
C-°(A). Let N* be an object of A / , and let 

(*) • • • 4 M- 3 ’* * M~ 2 ’* - * M- 1 - > M°’* > 0 - — -> • • • 

be a complex in C-°(A'), with total complex T* . 

• Assume that (*) is a resolution of N' in C~(A / ). Then T* is quasi-isomorphic 
to N * in C-°(A) (that is, it is a resolution of N * in the sense used in §6.2 
and following). 

• Assume that the cohomology of each is concentrated tn degree 0. Then T m 
is quasi-isomorphic to the complex of cohomology objects induced by (*): 

> — > 0 — > • . 


The statement of Theorem 8.12 extends that of Theorem 8.9; its proof is a 
nearly immediate consequence of the latter. As we mentioned at the beginning 
of §8.3, these statements are vastly generalized by the machinery of spectral se- 
quences, which provide a general framework for the computation of the cohomology 
of the total complex of a double complex. We will recover Theorem 8.12 once we 
have learned a bit about spectral sequences in §9.3, but it is not hard (and it is a 
good exercise) to prove this statement by hand, as we proceed to do. 

Proof. The second statement follows from the first, by hipping the corresponding 
double complex about the main diagonal. 
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The first statement follows from Theorem 8.9 and Claim 8.11. Indeed, let Mjf* 
be the exact complex 


> M~ 2 '* * M~ l> * * M 0 '* » N* »• 0 > 

by Theorem 8.9, TC{Mn)* is exact; by Claim 8.11 this is the mapping cone of the 
induced morphism T* —* N*, and it follows (Corollary 4.2) that this morphism is 
a quasi-isomorphism, as needed. □ 


To appreciate the usefulness of Theorem 8.12, again in the context of our (hard) 
work in §6, note that it provides us with a convenient bridge between resolving 
objects (see Lemma 6.3) and resolving complexes (as in Theorem 6.6). Indeed, by 
Corollary 7.9 (winch relies on the horseshoe lemma, therefore on resolving individual 
objects of A) every bounded-above complex N* in an abelian category with enough 
projectives may be viewed as the complex induced in H° from a complex 

» p- 3,» ► p-2,» > p-i,» > po,« ► o 

where P 1 ’* is a projective resolution of N* . The total complex of P*>* is then 
a projective resolution of N*, by the second statement in Theorem 8.12. This 
reproduces the main content of Theorem 6.6, bypassing the nastiest details of the 
proof given in §6. 

As mentioned in Remark 7.10, double complexes of projectives (or, working 
in C-°, injectives) resolving a given complex (and satisfying the additional condition 
explained in Remark 7.10) are called Cartan-Eilenberg resolutions. Their systematic 
use can simplify the material we presented in §6 but at the price of dealing early 
with double complexes. 


8.5. Acyclic resolutions again and balancing Tor and Ext. Finally we go 
back to the last mysteries remaining concerning Tor and Ext. As a warm-up (and 
another illustration of the power of double complexes), go back to the question we 
studied in §8.1: we ended that subsection by lining up the complexes obtained by 
applying an additive functor & to an ^-acyclic resolution A* of an object M and 
to a projective resolution of the same object, as sides of an array. Corollary 7.9 
tells us how to fill the rest of the array: start from 

(*) ).^0 > M > 0 }...., 

which is exact by hypothesis; using Corollary 7.9, view (*) as the complex induced 
in H° by an exact complex 

(**) > p- 2 ,« ► p- 1 ,* ► po,» *. p» f ► o }■ • • - 

where P t,m is a projective resolution of A 1 and Pjy- is a projective resolution of 
M. Since (**) is exact, the rows of the double complex corresponding to (**) are 
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projective resolutions of the projective objects Pm- Now apply & : 

> J V(A~ 2 ) > &{A~ l ) > 1?(A 0 ) ► &(M) 


-l 


-> .P(P~ 2 - 0 ) - 

I 

1 


-I-'- 

+ ^(P-1.°) - 

1 

I 


--T-- 

•> ^(P°*°) 

I 

- .^(P 0 '- 1 ) ■ 

I 


■ ^(P“ 2 ~ 2 ) > ^(P~ 2 - 1 ) > JT(p-2.°) 


t 

*Vid 

1 

T 

1 


Since each P| ; is projective and each A 1 is .^-acyclic, every row and every column 
of the part of this diagram within the dashed lines is exact. Therefore we can apply 
Theorem 8.12 to the complex 

/(P».») : >^(p-2.*)_4.^-(p-i,*)_+^(pO,.)_ >0 _^... 

and discover that the total complex TC(^(P))* is quasi-isomorphic to both com- 
plexes C(JP)(A*} and C(^)(P^). It follows that 

HH C(^-)(A*)) = WM&Wm)) = Li *(M), 
which was the conclusion of Theorem 8.3. 

This example illustrates the general strategy of other applications of double 
complexes: the cohomologies of two complexes are shown to be isomorphic by 
showing that there is an exact double complex interpolating between them. This 
strategy finally justifies the claims that Tor and Ext functors can be computed by 
resolving either one of their arguments. 

Theorem 8.13. Let M, N be modules over a commutative ring R, and let P* f , 
resp., PJIf, be projective resolutions of M, resp., N. Then 

H { (Ptf N) fit H\M ®r Pn) (= Tor? (Af, N)). 

The first term was our official definition of Tor, as originally given in § VIII.2.4 
(where we used free resolutions); see also Example 7.6. Theorem 8.13 makes good 
on our old promise (also made in § VIII.2.4) to show that the Tor functors could be 
computed by resolving the second factor rather than the first. 


Proof. Apply Theorem 8.12 to the complex 

(*) * Pm 2 ®* Pfi — > Pm 1 ®/t P& — ► Pw° ® « — > 0 — > ■ ■ • ■ 

Since each Pj^ is projective, the cohomology of this complex is concentrated in 
degree 0 and equals M ®jj Pn- It follows that 

TC(Plf ®jj Pjv)* is quasi-isomorphic to M ®r PJ, 
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by the first statement in Theorem 8.12. On the other hand, since each Pfo is 
projective, the complex Pjt/® 8 resolution of Pfa&N ; that is, the cohomology 

of each Ph ®r is concentrated in degree 0 and equals Pj* ®r N. Thus, the 
complex (*) induces the complex Pjy ®r N in H°, and 

TC{Pm ®r Pfi)* is quasi-isomorphic to P%[ <8> N, 
hy the second statement in Theorem 8.12- The result follows. □ 

Theorem 8.14. Let M,N be modules over a commutative ring R, and let P* t , 
resp., Q' n , be a projective resolution of M, resp., an injective resolution of N. 
Then 

iT(Hoinji(P£f, N)) at H'(Eom R (M, Q%)) (= Ext‘ H (M, N)). 

The proof of Theorem 8.14 is left to the reader (Exercise 8.9): Example 8.8 
and the strategy extensively discussed above will hopefully make this a very easy 
task. 

This statement completes the verification of all claims concerning Tor and Ext 
made in Chapter VIII, in the sense that the extent to which we have developed the 
general theory of homological algebra makes those seemingly magical claims now 
essentially evident Hopefully, the same will apply to other encounters the reader 
may have with simple applications of homological algebra. 

hi any case, this seems a fitting place to end the main body of this last chapter. 
In §9 we will give a brief look at material that really lies beyond the scope of these 
notes. 


Exercises 


8.1. t> Let A, B be abelian categories, let A* be an exact complex in C(A), and 
let & be a right-exact functor A — * B. Let K' be the kernel of <? A . Prove that 

H i (C{^)(,A*)) Si ker (#(iC +1 ) -> J?04 i+1 )). 

(§ 8 - 1 ] 

8.2. Let M be an object of an abelian category A with enough projectives, and let 
JF : A — > B be an additive functor. Let A m be an ^-acyclic resolution of M , and 
let J' := im <P A . Prove that 

U &(M) = L j Jf(J- I+1 ) 

for all i > 1. 

8.3. > Verify carefully that dtot ® = 0 (cf. Claim 8.7). (§8.2) 

8.4. t> We have obtained a double complex from a commutative array of objects 
and differentials d*’* by ‘changing the signs of every other column’. Prove 

that changing the sign of every other row leads to an isomorphic double complex. 
(In particular, the cohomology of the total complex is essentially unaffected by 
flipping the original array.) [§8.2, §8.3] 
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8.5. o With notation as in Example 8.8, prove that the i-th cohomology of the 
total complex of Hom A (L*, M*) is isomorphic to Homx( a) , AJT* [i] ) . (Warning: 
These isomorphisms require careful sign adjustments.) [§8.2, §8.3] 

8.6. t> Let A/*’* he a double complex on an abelian category A, such that (for 
example) = 0 for i,j > 0. Prove that H k (TC(M)*) = 0 if the rows (or the 
columns) of M*>* are exact at all with i + j = k. [§8.3] 

8.7. t> Prove Claim 8.11. [§8.4] 

8.8. -i Let A, B, C be abelian categories, and let & : A — > B, <3 : B — * C be additive 
functors. Assume that A and B have enough projectives, that & sends projectives 
of A to ^-acyclic objects of B, and that if is right-exact. 

Let A be an object of A, and let M*: ■ • • — * M ~ 2 —» M~ 1 —* M° — » 0 be 
a projective resolution of A. Let be a Cartau-Eilenberg resolution of the 

complex &M* (as constructed in Corollary 7.9). Consider the complex 

(t) >Ww-») A/-*) >0 

obtained by applying if to PJ - M ,. Finally, let T* be the total complex correspond- 
ing to (f). 

Prove that H n (T*) S L”(if o &){A). 

(Hint: Prove that the cohomology of each term of (f) is concentrated in degree 0, 
and apply Theorem 8.12.) (§7.2, 7.6, 9.10] 

8.9. i> Prove Theorem 8.14: Let M, N be modules over a commutative ring R, and 
let Pjy, resp., Q%, be a projective resolution of M, resp., an injective resolution 
of N; prove that 

P i (Homfl(PJ f , AT)) Sf fP(Hom*(M, Q' N )), 

completing the justification for the definition of the Ext functors given in §VIII.6.4. 
[§8.5] 

8.10. If an abelian category A has enough projectives and injectives, we can define 
functors Ext A as derived functors of HomA, by resolving either component (as we 
did in £-Mod). 

Prove that an object P of A is projective if and only if Ext A (P, B) = 0 for all 
objects B of A, if and only if Ext A (P, B) = 0 for all objects B of A and all > > 1. 

Prove that an object Q of A is injective if and only if Ext A (A, Q) = 0 for all 
objects A of A, if and only if Ext A (A, Q) = 0 for all objects A of A and all i > 1. 

8.11. -i Let A be an abelian category with enough injectives and projectives. The 
projective dimension of an object A of A is the minimum number d = pd(A) such 
that there exists a projective resolution 

of A, if this number exists, or oo if A has no finite projective resolution. Prove that 
the following are equivalent: 

(i) pd(A) < d. 
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(ii) If 0 — ► JRf — ► P -<i+1 —»•••—» P° — ► 0 is a resolution of A and all P l are 
projective, then K is also projective. 

(iii) Ext"(A, B) = 0 for all objects B and all n > d. 

(iv) Ext rf+1 (^4,fi) = 0 for all objects B. 

(Hint: (ii) ==i- (i) =4- (iii) =4> (iv) =$• (ii). For the last implication, use 
‘dimension shifting’: adapt Exercise 7.12.) [8.12, 8.13] 

8.12. -i Let A be an abelian category with enough injectives and projectives. The 
injective dimension of an object B of A is the minimum number d = id(B) such 
that there exists an injective resolution 

of By if this number exists, or oo if B has no finite injective resolution. Prove that 
the following are equivalent: 

(i) id(B) < d. 

(ii) If 0 — ► Q® — * Q d_1 — ♦ (7 — * 0 is a resolution of B and all Q' are 
injective, then C is also injective. 

(iii) Ext™ (.A, B) = 0 for all objects A and all n > d. 

(iv) Ext rf+1 (A,B) = 0 for all objects A. 

(See Exercise 8.11.) [8.13] 

8.13. (See Exercises 8.11 and 8.12.) Let A be an abelian category with enough 
injectives and projectives. Prove that the supremmn of pd(A), A <£ Obj(A), equals 
the supremum of id(B), B € Obj(A), and give an Ext interpretation for this num- 
ber. 

This is called the global dimension of A. If B is a ring, the global dimension of 
R is the global dimension of B-Mod. This invariant is of fundamental importance in 
commutative algebra. The relation between this number and the Krnll dimension 
of R. is subtle and important. 

8.14. O Let A, B, C be categories. A bifimctor Ax B — * C is an assignment of 
an object &(A, B) of C for every object A of A and B of B, such that for all objects 
A in A, B in B, := & (A, _} is a (say, covenant) functor and @b := <^(_, B) is 
a (say, contravariant) functor; further, for all morphisms Ai — ♦ A 2 in A, Bx — * B 2 
in B, the induced diagram 

^(Ax.Bx) ^(Ai.Bz) 

T T 

■?(A 2 ,Bi) >Jr(A 2 ,B 2 ) 

is required to commute. (Arrows in this diagram should be directed according to 
the covariance of & in its arguments.) 

For example, HomA(_,_) is a bifunctor in this sense from A x A to Ab. For B 
a commutative ring, _ is a bifunctor B-Mod x B-Mod — * B-Mod, covariant in 
both arguments. 
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Now assume that A, B, C axe abelian categories and that #:AxB-*Cisa 
bifunctor, such that & A is additive and covariant, is additive and contravariant, 
and both are left-exact. Also, assume that B has enough injectives. 

Say that an object B of A is & -exact if the functor is exact. An 4 -exact 
resolution 1 of an object A of A is a resolution in C-°(A) consisting of ^-exact 
objects. The category A has ‘enough ^-exact objects’ if every object A admits an 
epimorphism E — * A with ^ -exact source. Assume that this is the case, so that 
every object of A admits an ^-exact resolution. 

Note that we are not assuming that A has enough projectives: in principle, 
therefore, we should not be able to define a right-derived functor for &b- We are 
going to look for an adequate replacement, using the fact that is one component 
of a bifunctor. 

Since B has enough injectives, the functor can be right-derived for every 
object A. Define T t ^g(A) to be R’& A (B); note that T°&s(A) = & A {B) = 
&(A,B)=?s(A). 

• Prove that every exact sequence 0 — > Ai — * Aj — * A 3 — > 0 in A induces a long 
exact sequence 




■+ &b(A 3 ) > ^b(Az) * &b(A 1 ) 

s a~ 


3 




■4 1 • ‘ ' 
-5 j - 


>T‘^b(Ai) 


3 


&b(M) 


• Prove that the T'&b form a 5-functor, in the sense of Exercise 7.13. 

• Prove that if E is an ^ -ex act object of A, then T*&b(E) = 0 for i > 0, and 
deduce that the T*^b form a universal 6- functor. 

• Let E~ 2 —* E~ l — * E° — ► A — ► 0 be an J?-exact resolution of A. 

Then T*.^b = (Hint: Mimic the proof of Theorem 8.3.) 

The upshot is that the interplay of the two Junctors & A and &b allows us to 
define T'&b as if it were a right-derived functor, using T-exact objects in place of 
projectives. This gives a universal 5-functor, which agrees with the standard right- 
derived functor when A has enough projectives (by Exercise 7.15 and the uniqueness 
of universal 5- functors). (§8.1] 


9. Further topics 

This last section touches upon issues that arise naturally in the context of this 
chapter and for which a (much) more extensive treatment would be needed. This 
will hopefully whet the reader’s appetite for more and encourage further study of 
homological algebra beyond the basics covered here. We will be somewhat detailed 
in §9.3, illustrating the spectral sequence associated to a double complex, since 
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this can be done reasonably quickly (but delving into any other application of 
spectral sequences would simply take us too far). For the other themes in this 
section (derived and triangulated categories) we will really do no more than a little 
propaganda. 


9.1. Derived categories. Derived categories made their appearance in §4.2 and 
have since served as a guiding concept motivating the natural development of the 
subject, but we have stopped short of constructing them in general. The closest we 
have come to a concrete description of the derived category is in the ‘poor man’s 
version’ of §6.3, which should actually be labeled the ‘rich man’s version’, since it 
relies on the fortunate presence of enough projectives/iujectives. Also, the versions 
D _ (A), D + (A) considered there are necessarily bounded, as they rely on bounded 
resolutions of objects and complexes in A. 

Derived categories may be defined without such restrictions. For an abelian 
category A, the objects of D(A) are taken to be the same as in C(A), that is, 
cochain complexes; boundedness conditions may be included here, if desired. As 
for morphisms, remember that the whole point of the derived category is to ‘invert 
quasi-isomorphisms’; the most direct way to do this is essentially the same as the 
process that allows us to ‘invert all nonzero elements of an integral domain’, leading 
to the construction of the field of fractions (see §V.4.2). This process, in a more 
general version dealing with inverting ‘multiplicative subsets’ of a ring, is called 
localization (Exercise V.4.7). Categories may be localized as well. For example, the 
homotopic category K(A) may be viewed as the localization of C(A) with respect 
to homotopy equivalences: these become isomorphisms in K(A). Localizing K(A) 
with respect to homotopy classes of quasi-isomorphisms yields D(A). 

In necessarily oversimplified terms, suppose you want to make a class of mor- 
phisms of a category C invertible. In the new category, any ‘roof’ diagram 


Z 



A -* B 


with a in that class will determine a morphism o a -1 : A -+ B\ that is, the dashed 
arrow will have to exist as a morphism in the new category. These are the analogs 
of the ‘fractions’ in the held of fractions; the localization of C may be defined by 
taking the same objects as in C and setting morphisms in the new category to be 
compositions of ‘roofs’, up to a suitable equivalence relation. For example, all roofo 


Z 



A A 
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with a in the chosen class should be identified to one another and to id ,4 in the 
localized category; the roof 

Z 


A Z 

will work as the inverse of a in the localization. 

As the reader can imagine, working out the construction in earnest requires 
dealing with nontrivial technical issues. These are more manageable when the 
class of morphisms to be inverted is ‘localizing’ (for example, the composition of 
two such morphisms should also he a morphism in the class); for example, this 
condition allows us to represent the composition of two roofs as a roof. To mention 
one set-theoretic difficulty, one should make sure that morphisms A -* B in the 
localization still form a set, even if the roofs connecting AtoB may a priori form 
a proper class. 

With all these caveats in mind, the localization process can be carried out 
on K(A) with respect to quasi-isomorphisms and does produce a category D(A) 
satisfying the universal property for derived categories described in §4.2. If A has 
(for example) enough projectives, then the bounded version D~(A) is equivalent to 
the homotopy category K~(P) of complexes of projectives, as we have endeavored 
to justify in §6.3. The latter should be viewed as a convenient way to ‘compute* 
the former in favorable circumstances. 

Summarizing, a morphism L* — »• M m in the derived category D(A) of an ahelian 
category A can be represented as a roof diagram 

N* 

quasi-ii 

L* M* 

of (homotopy classes of) morphisms of cochain complexes, modulo a suitable equiv- 
alence relation. Note that the construction is applied to K(A), rather than the 
simpler-minded C(A): for one thing, one might as well start from K(A), since the 
functor C(A) — > D(A) will have to factor through the homotopic category (Propo- 
sition 5.4). In any case it just so happens that, unfortunately, quasi- isomorphisms 
do not form a localizing class of morphisms in C(A), while their homotopy classes 
are localizing in K(A). So it goes. 

Once we believe that this prescription does define a category — for example, that 
the composition of two roofs can be represented by a roof and that this composition 
is associative — then it is essentially clear that D(A) satisfies the universal property 
specified in §4.2. However, the real benefits of introducing the derived category lie 
deeper, for example in its structure as a triangulated category. A brief discussion 
of this concept follows. 

The reader should check that the composition of two roofs is indeed a roof 
(Exercise 9.2); this may be easier after acquiring a little familiarity with distin- 
guished triangles. Also, note that the diagram in Exercise 9.2 does not commute 
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in C(A), but it does in K(A). This is an indication that it would be problematic to 
go 'directly’ from C(A) to D(A). 

9 . 2 . Triangulated categories. One reason for not treating derived categories to 
any real extent in this book is that to do so would require us to define precisely the 
notion of triangulated category. The same applies in fact to homotopic categories 
of complexes. We have pointed out in §5.2 that these categories should not be 
expected to be (and indeed are not) abelian, but they preserve enough structure 
to make sense of, for example, long exact cohomology sequences. The fact that 
derived functors ended up haring long exact sequences associated to them (§7.4) 
bears witness to this fact. 

The essential ingredients of a triangulated (additive) category are a ‘translation’ 
functor, which in the case of K(A) or D(A) is realized as the shift functor L* i-» L[l)*; 
and a class of diagrams 


A 



where the morphism labeled ‘+1’ acts as C — » A[l); these diagrams are called 
distinguished triangles. A common alternative notation is 



Distinguished triangles axe required to satisfy several axioms. Among them, 


0 



is required to be distinguished for every A\ every morphism a : A — > B must be 
a side of a distinguished triangle; triangles isomorphic to a distinguished triangle 
(in the evident sense) must be distinguished; and distinguished triangles can be 
‘rotated’: 



is distinguished if and only if 



is distinguished. There is more, including an infamous octahedral axiom (so named 
since a popular way to state it invokes an octahedral diagram). The reader will 
have no difficulties locating this information in the literature. 
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The category K(A) is triangulated: the translation functor is the basic shift of 
a complex, and distinguished triangles are those isomorphic to 


L* 



MC(o) m 4 M* 

That is, the ‘third vertex’ of a triangle with an assigned side a* : L* -* M * is the 
mapping cone of a*, and the unlabeled sides are the natural cochain morphisms 
M* — MC(ft)', MC(a). -* L[l]* studied in §4.1. 

It would be problematic to make the same choice in C(A), since while 


0 



L u +-r. L • 

id£* 


would be distinguished as needed, since L* is the mapping cone of the zero-morphism 
0 -» L*, the rotation of this triangle, i.e., 


L* 



O'* IS 


would not seem to be, since the mapping cone of the identity is not 0. But the 
mapping cone of the identity is homotopically equivalent to 0 (Exercise 9.3), so the 
rotated triangle is indeed distinguished in K(A), as it should be, according to the 
axioms. More generally, it is easy to see (Exercise 9.4) that if a* : L* -* M* is a 
cochain morphism, so that we have a distinguished triangle, 


£* 



MC(a)‘ M* 

then the mapping cone of j3* is in fact homotopy equivalent to L[l)*; the rotation 



L[l]*< MC(a) m 

is distinguished as required, as the reader should check. 

Applying the cohomology functor to a distinguished triangle in K(A) yields an 
exact triangle: indeed, as we have just seen, we may assume that the triangle is (*) 
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(up to isomorphism in K(A)), and taking cohomology gives the exact triangle 





Jf*(L(l]*) 4 H*{MC(a) m ) 

obtained in Proposition 4.1. In general, a cohomological functor on a triangulated 
category is an additive functor to an abelian category, mapping distinguished trian- 
gles to exact triangles and hence inducing long exact sequences. Thus, cohomology 
is a cohomological functor on K(A); this will not surprise our reader. The functors 
Hom(A,_) and Hom(_, A) (to Ab) are cohomological functors on every triangulated 
category, for every object A. 

The derived category D(A) and its bounded versions axe triangulated categories, 
with distinguished triangles described as above — that is, isomorphic (in D(A)) to 
the basic triangles determined by mapping cones. The natural functor K(A) — > D(A) 
is a functor of triangulated categories, in the sense that it preserves the translation 
functor and it sends distinguished triangles to distinguished triangles. 

Isomorphism in the derived category is a less stringent notion than in the 
homotopic category, so there are ‘more’ distinguished triangles in D(A) than in 
K(A). For example, if 

0 > L* - s -* M* N * > 0 

is a short exact sequence, as above, there is in general no distinguished triangle 


r 

in K(A) (for any choice of "/*): indeed, N* need not 38 be homotopically equivalent 
to MC(o)'. But such triangles do exist in D(A)! 

In fact, we are now well-equipped to make (better) sense of the mysterious 
remarks at the end of §3.4. The ‘special triangles’ 


L* 



IV* 4 M* 


arising as in §3.4 from a short exact sequence 

0 > L * M* > IV* > 0 

are not distinguished iu K(A), and in general no replacement for the zero-morphism 
will fix this problem. On the other hand, the reader can now verify (Exercise 9.5) 
that there is a quasi-isomorphism MC(a)* — * N*: this can be shown by computing 

38 Why? Construct an example showing this. 





686 


IX. Homological algebra 


explicitly another mapping cone and applying Corollary 4.2. Thus, in the derived 
category the standard distinguished triangle 


L* 



MC(q)* 4 — — — M* 


is isomorphic to a triangle 


y 

N * < — — — M* 

which is therefore distinguished in D(A) (cf. Exercise 9.6). Applying the cohomology 
functor to this triangle directly gives the exact triangle 








expressing the long exact cohomology sequence as in §3.4, without having to single 
out the +1 morphism for special consideration. In retrospect, the zero-morphism 
we insisted on using when folding ‘special’ triangles in §3.4 turns out to be an 
artifact: placing the triangle in the derived category shows that there is a more 
informed choice for this morphism. This choice is not available in C(A) and is not 
even in K(A), but it is available in D(A) and leads transparently to the long exact 
sequence in cohomology. 

If & : A — » B is a functor between abelian categories, the induced functor 
K(«^) : K(A) — ► K(B) is a functor of triangulated categories, and so are the bounded 
variations K ± (^') : K ± (A) — » K ± (B). The derived functors L.^, R<? : D ± (A) — ♦ 
D^(B) encountered in §7.2 are also functors of triangulated categories. This fact 
is responsible for the good properties of derived functors, such as the long exact 
sequences studied in §7.4. 


9.3. Spectral sequences. We mentioned in §8.3 that ‘spectral sequences’ may be 
used to compute the cohomology of a double complex. There unfortunately does 
not seem to be an easy entry point iu the subject of spectral sequences, which is 
marred by intrinsic notational complexity. We will limit ourselves to a very scant 
description of the concept and some information linking it back to §8.4. Our moti- 
vation is that, in the literature, references to results such as Theorem 8.12 are often 
replaced by sentences such as ‘by an immediate spectral sequence argument. . . ’, 
and we should try to explain to the reader what this means. 
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Given that we were thinking about triangles a moment ago, the following con- 
struction may be helpful in appreciating the notion of spectral sequence. Let 


A 



E^-A 


he an exact triangle 39 in an abelian category; we are placing no assumption on 
the ‘degree’ of 7 (and indeed, we are not assuming we have defined a translation 
functor). Let d : E — * E be the composition fio 7; since 70/? = 0, we have 

dod= 0 ( 70 / 3)07 = 0 . 

Thus we can take the homology of E with respect to d and define 

, = herd 
imd" 

Further, we can let A' := im a and let 

• a' : A' —* A' be the restriction of a to A'; 

• ft ' : A’ -* E' be defined by /3'(a(a)) := [.3(a)]; 

• 7' : E' — * A' be defined by 7'([e]) := 7(e), 

where [e] denotes the class of e £ £ in E' . (Since /? » 7(e) = d(e) = 0, 7(e) € ker P = 
imor = A 1 .) All needed verifications (such as the independence on the representative 
e of [e] in the third prescription) are straightforward, from the exactness of the given 
triangle. 


Claim 9.1. The new triangle 

A' 



is again exact. 


The reader will have no difficulty proving this statement (Exercise 9.7). 

The datum of an exact triangle as above is called an exact couple, which we find 
confusing since a triangle has three vertices (but it is true that only two objects are 
involved here); the new exact triangle (couple) obtained in Claim 9.1 is the derived 
couple, which we find even more confusing since there is no derived category or 
functor in sight. Exact couples arose in topology (they are due to W. Massey), and 
the terminology reflects their origin. 

Of course we can turn the crank at will and get derived couple after derived 
couple: let Aj — A, Ei = E, etc., and inductively define Aj+i := A\, Ei + 1 := E[, 
etc. For example, /?;+! is obtained by ‘rewinding’ the morphism a a total of i times 


39 That is, imo = ker 3 t im 0 => ker 7 . and im7 = ker a. 
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and then applying the original morphism ft: we can do this since A i+i is the image 
of a'. 

Thus, one exact triangle produces a whole sequence of triangles and in par- 
ticular a sequence of objects Ei, each endowed with a differential di, such that 
Ei+i — ker d, / im di . 

Definition 9.2. A spectral sequence {(Ei, dj)}y=i. 2 ,... is a sequence of objects E, 
and morphisms di : E, — » Ei in an abelian category, such that djodj = 0 and 
Ej + i = ker d, / im d,- . j 


Thus, exact couples are a way to produce spectral sequences. There is a sense 
in which we can turn the crank ‘infinitely many times’; in fact, this can be done 
with any spectral sequence. To see this, let 

Zi — Ei, B\ = 0, 


so that Ei S= Zi/Bi\ inductively, assume we have defined Z x C Z\ , B, C Z, so 
that Ei = ZijBi, and let Z,+i — kerdj, B x+ 1 = im <£; define Z< + j, B, + i as the 
corresponding subobjects of Zj. Then 


r ~ Z i + 1 ~ 
E i+ i = = — = 
t>i+\ 


Zj+i 

B i+ 


realizing £ I+ i as a subquotient 40 of E\, and 


Bi C • • • C Bi C B i+1 C • • • C Z i+ i C Zi C • • • C Z,. 


Define 41 


Boc ^UBi, Zoo :=P) Z ', 


This ‘ultimate’ subquotient Eoo is the limit of the spectral sequence; it is common 
to say that the spectral sequence E r abuts to E^. By definition, if d r = d r +i = 
■ = 0, then Zoo = Zr and Boo = B r , so that £» — E r \ in this case we say that 
the sequence collapses at E r . 


The reader will verify (Exercise 9.8) that if the spectral sequence arises from 
an exact couple, as above, then 


Zoo = 7 1 (H im a ^’ B °° = *<U ker Q ’ )’ 

r r 


so the limit Eoo has a concrete realization in this case. 

Typically, a spectral sequence is as interesting as its limit, especially if some 
fortunate circumstance causes its collapse very soon; it is not too uncommon to 
have situations in which Eoo — Ei- If all goes well, this translates into a useful 
relation between the given ‘input’ E\ and the interesting ‘output’ Eoo- 


40 That is, as a quotient of a subobject. 

41 Here we are implicitly working in a category ft-Mod of modules over a ring, containing 
the given abelian category A (invoking the Freyd-Mitchell theorem). The infinite unions and 
intersections used here make sense in ft-Mod; the limits and the corresponding Eoo are therefore 
defined as objects in ft-Mod and in general not as objects of A. But if, e.g., the sequence collapses 
at then Boo = Br, Zoo = Z r , and Eoo exists in A. In practice, this difficulty will play no role 
in the considerations that follow. 
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Where is my double complex? We promised that spectral sequences could be 
used to ‘compute’ the cohomology of the total complex of a double complex. We 
will now see how a double complex gives rise to an exact couple and hence to a 
spectral sequence. This is a particular case of a useful mechanism producing an 
exact sequence from a filtration. 

A ‘descending filtration’ of M consists of a sequence of subobjects: 


M D - D M m D M m +i D - . 


For example, the series of subgroups of a group G considered in § IV. 3.1 are filtra- 
tions of G. If M is an object of an abelian category, a filtration as above determines 
an associated graded object 


gr(A/) := 0 gr(M) m , gr(Af ) m 

m 


M m . 

Mm+i 


We will assume this is an object of the same category for simplicity, although 
this is not necessarily the case (abelian categories are not necessarily closed under 
infinite direct sums); since only finitely many objects will be involved in any specific 
computation, this plays no role. 

Start with a double complex M*'*\ we will assume that M*’ 3 = 0 for i < 0, 
j < 0: 


M °< 2 ... dh . > 

k 

!<• 

As always, zero-objects to the left and below the given part of the diagram are 
implicit; we are assuming (as in Definition 8.5) that the diagram anticommutes, so 
that the differential of the total complex TC(M)* is simply dh + S v . Analogous 
results can be obtained if = 0 for * > 0, j > 0 and in feet under more relaxed 
hypotheses; such considerations are left to the reader. 

Let T* = TC(M)* denote the total complex: therefore, T k = 

This complex admits (at least) two tiltrations: a horizontal filtration and a vertical 
filtration. We will focus on the vertical one, and again it is understood that a 
parallel discussion would hold for the horizontal one. The vertical filtration T^, is 
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defined by chopping off the terms to the left of a vertical bar i = m in the diagram 


M" ' 


\r 


That is, we set 

T k m := 0 

i+j=k 

i>m 

still with differential dh + S v . We will denote the corresponding graded object 
by grJ(T), to record the fact that this arises from the vertical filtration. (The • 
reminds us that this is still a complex.) Explicitly, the term of ‘filtration degree’ m 
in grJ(T) is gr$(T) m = 7£/7* +1 = ©<>„, M i,k ~'/ © 4 > m+1 S 

The differential dh + 6 V induces the differential S v on this graded piece, since dh is 
zero modulo the next piece of the filtration. The modules on the original array can 
therefore be labeled as follows: 


\r - m 2 ' 2 

}•■ 

\l ' M 2 ' 1 ■ 

V 

\l ' A/ 2 - 0 


dh 


dh 


■* M 3,2 

K 
■> M 3 ’ 1 

Sv 


dh 


+ M 3,0 


dh 


dh 


dh 


f'"' 

. f ''' 

. f''' 

. f' 

gr^COo 

gr?(T)i 

gr«(T) 2 

gr 5(T) 3 

T '' 

. T '' 

. T '' 

„ T' 

gri(T) 0 

gr2(T)i 

gr 3(T) 2 

gr l(T) 3 

T '' 

. T '' 

. T '' 

, T ' 

grS(T) 0 

gri (T)x 

gr 2 v (Th 

gr?(T) 3 


with dashes connecting pieces with the same degree in gr*(T). 

The filtration on T* also determines a filtration on the cohomology of T* : we 
can take H*{T*) m to be the image in H‘(T*) of H*(T . *). Thus, we also have 
a graded object @c v H*(T*). The relation between ff*(grJ(T)) and gr V H*(T*) is 
subtle: this relation is what spectral sequences will help us understand. 

The monomorphisms T^ +1 C 7£, define a monomorphism © m — * ® m T^, 

of which grJ(T) is the cokemel: we have an exact sequence 

(t) 0 >© m 2^ >© m r* >gr;(T) >0. 
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As we know since §3.3, this determines an exact triangle (couple!) 


tf*(gr*„(T)) < 

Note that a decreases the filtration degree by 1 , leaving the cohomology degree k 
unchanged; 0 leaves both unchanged; and 7 increases both by 1. This is particularly 
clear if one draws a small slice of the sequence (f), sufficient to see the connecting 
morphism in action on a piece of given filtration and cohomology degrees: 

0 » * T*+‘ > grj +» (T) m > 0 

I "I-.. 1 

0 > T* +1 > T£ ► gr k v (T) m > 0 

Definition 9.3. The spectral sequence of the double complex M*'* (with respect to 
the vertical filtration) is the spectral sequence determined by this exact couple, j 

We will denote this spectral sequence by V E, with the absurdly positioned in- 
dex v recording that V E arises from the vertical filtration. The horizontal filtration 
leads likewise to a spectral sequence hE. 

All terms v E r of the spectral sequence are subquotients of the first one, v E t — 
ff*(ir;(T)). 

Remark 9.4. The differential d r = 0 r ° -y T acts by increasing the cohomological 
degree k by 1 and the filtration degree m by r: indeed, 7 r is simply induced by 7 , 
so it increases both k and m by 1 (as observed above); 0 T is obtained by applying 
0 after undoing (r - 1) times the effect of a (as pointed out after Claim 9.1), so it 
does not change k but increases m by (r - 1 ). j 

Interest in the sequence rests on the interpretation for its limit in the statement 
that follows, which also summarizes the situation. 

Theorem 9.6. Let M *’* be a double complex in an abelian category. Assume that 
M' j = 0 for i < 0, j < 0, and let T * be the total complex of A/*’*. Then, with 
notation as above, there exists a spectral sequence {( v Ej,di)}i such that 

v E y * H'faXT)), V E X =* gr„ f/*(T*). 

In other words, ‘turning the crank’ moves the gr from inside the cohomology 
to outside of it. The limit V E X does not quite compute the cohomology of the 
total complex, as we glibly announced in §8.3, but it computes the graded object 
determined by a filtration on the cohomology of the total complex, and this is good 
enough for many applications. 
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Proof. The limit V E X can be computed directly from the exact couple (Exer- 
cise 9.8), as a subquotient of V E\: it is the quotient Zoo/ Boo, where 

Zoo = B °° = $(U kera r ). 

r r 

The computation is streamlined by focusing on a specific degree k for the cohomol- 
ogy and m for the grading. The corresponding part in „E\ is H k (T^ x /T^ l+l ), and 
q acts as H k (T^) — H k (T£ t _ 1 ). Note that 

if r » 0, then H k (T^ +r ) = 0, f/ fc (7^_ r ) = H k CT), 

under our boundedness hypothesis: indeed, T- = 0 if i > j, and T* = T* for i < 0. 
By the first of these observations, the piece sent by a T to H k (T£ x ) is 0 for r » 0, 
and it follows that im a r = 0. Therefore, 

Zoo — ker7 = im/3. 

By the second observation, kera r stabilizes to ker(H k (T£ l ) — * H k (T*)) for each k, 
hence 

Boo = /3(0ker(//*(TA) - H'(T*))). 


Denote by v m : H*(T * ) — * H*(T*) the morphisms induced in cohomology by the 
monomorphisms — ► T* and by v = 0 v m their direct sum. We have morphisms 




and we have obtained 


By the innocent Exercise 2.12, 


vE 0 


/3(ker u) ' 


im/3 imi/ 
/3(keri/) i/(ker /?)’ 


by the exactness of the original couple, 


ker/3 = ima = 0im(/T(T‘ +1 )-/f(TA)), 

m 


hence 

i/(ker/3) = 0imj/ m+ i. 

Therefore 



^ imi/m+1 

as stated. 



□ 


Theorem 9.5 is a substantial generalization of Theorems 8.9 and 8.12. To see 
that it implies Theorem 8.12, note again that 


g*J(r) = 0 ;S- 

m i m+l 


=*0M m ’ fc - m 

rn 
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with its ‘vertical’ differential S v . The term of the spectral sequence is the 
cohomology of this complex, that is, the cohomology of the columns of the orig- 
inal double complex, suitably shifted. If the cohomology of each column A/’'* is 
concentrated in degree 0, then v E l is simply the complex 

(*) > 0 > H°(M 0 '*) > /f°(A/‘'*) > H°(M 2 '*) > 

hence „ E 2 is the cohomology of this complex, and higher differentials of the spec- 
tral sequence are 0 by degree considerations. It follows that the sequence collapses 
at „E 2 , and by Theorem 9.5 this says that gr„(//*(T*)) is isomorphic to the coho- 
mology of (*). In this case we clearly have gr„(/f*(T*)) 3* H*(T *), so the conclusion 
reproduces the corresponding statement in Theorem 8.12. This is what is meant 
by the incantation ‘by an immediate spectral sequence argument ’ we mentioned in 
the beginning of this subsection. 

This reasoning and other applications of the spectral sequence of a double 
complex are easier to understand if the various degrees and shifts are visualized in 
a more compelling way. View the original double complex 


t T T f 


M 0 - 2 

T 

A/*- 2 

T 

M 2 - 2 

T 

M 32 
T 

1 

M°- x 

T 

1 

M >•* 

t 

1 

M 2 - 1 

t 

M 31 

T 

M 00 

1 

A/ 10 

1 

M 20 

1 

Af 30 


as an 'Eq term in the sequence. The horizontal differentials are dotted out to 
highlight the fact that U E\ is obtained by taking the cohomology of the columns; 
this gives 


H 2 (TS/T n — > // 3 (T 1 */T 2 ‘) — > h 4 (t;/t;) — > // 5 (t 3 vt 4 *) — ► ••• 
H l (TS/T n ♦ H 2 (T*/T*) ♦ tf 3 (7?/T 3 ‘) ♦ W 4 (T 3 VTT) ► ••• 

H°(TS/T :) — ♦ H'(T;/T1) — ♦ w 2 (7?/t 3 *) — > w 3 (t 3 */t 4 ‘) — ♦ • • • 

For example, the part in degree 2 of tf 3 (grJ(T)) is the cohomology of the second 
column M 2 '* computed at gr 3 .(T )2 = A/ 2,1 , so we placed the corresponding object 
// 3 (T*/T 3 ) in position (2, 1). The differential of X ,E X increases both the cohomolog- 
ical degree and the filtration degree by 1, as indicated. Such diagrams are somewhat 
hard to parse. To simplify dealing with the indices, it is common to assign degrees 
(i,j) to the terms in V E\ according to the indices of the objects M' J = M m k ~ m 
at which the cohomology is computed; that is, set 

vE\ j = H i+J (T’ /T* +l ). 
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Then the same array is drawn as 


jt» 0,2 v p 1.2 v p2,2 \ r*3,2 v . . , 

v&\ 7 v*-* \ 7 v^i 7 v^i 7 

r>0,l v nl,l v p 2,1 v 17 * 3*1 v . . . 

v&l 7 v*- J \ 7 v^\ 7 7 

Z7»0,0 v ril.O v ri2,0 v 17*3.0 . . . . 

u-C'l 7 v *- J l 7 v^\ 7 v&\ 7 

The next term V E 2 is obtained by taking the cohomology of V E%. Its differential 
acts by increasing to by 2 and k by 1, that is, i by 2 and j by —1. We can draw 
this as 



In general, v E r is the cohomology of „E r _ ,, and its differential increases to by r 
and k by 1 (Remark 9.4); that is, (*,y)*~* (i+r,j-(r-l)). Pictorially, differentials 
at different stages act like this: 

vEi vEl v^3 



and so on. This picture of a ‘rotating’ differential may be the image that most people 
have in mind when they think of a spectral sequence. The argument sketched 
above, deriving Theorem 8.12 from Theorem 9.5, is probably clearer from this 
point of view: if the cohomology of the columns is concentrated in degree 0, then 
V E\ is concentrated along the bottom row of the array. It is then clear that all 
differentials di for i > 2 will have to be zero, since either their source or their target 
is zero. Thus the sequence collapses at V E 2 and yields the cohomology of the total 
complex by Theorem 9.5. The other situation considered in Theorem 8.12 can be 
analyzed by using the sequence obtained from the horizontal filtration. 
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Of course, spectral sequences are not limited to these simple applications. The 
Grothendieck spectral sequence ‘computes’ the derived functor of the composition 
of two functors 42 : for example, if & : A — > B and <£ : B — > C are two right-exact 
functors and & sends projectives to projectives, then there is a spectral sequence 
whose Ei term collects the compositions \-pS ° L q & and that abuts to L p+q (^o^). 
For instance, if f : R —* 5 is a homomorphism of commutative rings (so that 
S may be viewed as an 12-module), .A is an H-module, and B is an 5-module 
(and hence an H-module, via /), then there is a ‘change-of-ring spectral sequence’ 
Torp (Tor^(A, S), B) =► Torf (A, B). In topology, the Serve spectral sequence can 
be used to compute the homology of a fibration in terms of the homology of the base 
and of the fiber. It would be completely futile for us to attempt to do any justice to 
the range of applications of spectral sequences: the set of mathematicians X such 
that there is an important X -spectral sequence includes (but is not limited to) 
Adams, Atiyah, Barratt, Bloch, Bockstein, Bousfield, Cartan, Connes, Eilenberg, 
Federer, FYolicher, Green, Grothendieck, Hirzebruch, Hochschild, Hodge, Hurewicz, 
van Kampen, Kan, Kunneth, Leray, Lichtenbaum, Lyndon, May, Miller, Milnor, 
Moore, Novikov, de Rham, Quillen, Rothenberg, Serre, Steenrod, 

Many important spectral sequences may be derived as particular instances of 
the Grothendieck spectral sequence mentioned above. The reader will have the 
pleasure of seeing how this spectral sequence is put together by working out Exer- 
cise 9.10. 


Exercises 


9.1. Let A be an abelian category. Since the objects of K(A) and D(A) are sim- 
ply cochain complexes, a ‘universal Euler characteristic’ \ is defined for bounded 
complexes in these categories (see Exercise 3.15). Prove that if 


+i 


C <- 


is a distinguished triangle in K(A) or D(A), then \(B) = x(A) + x(C). 

9.2. t> Let A be an abelian category, and suppose two complexes L*, N* are con- 
nected by an ‘upside-down roof’ (a ‘trough’?) 


L* AT 

quasi-isomorphism 

N m 


42 


For a different viewpoint on this question, see Remark 7.4. 
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Prove that they are also connected by a regular roof: there exists a complex N * 
and morphisms N * — * L* and N * — ► Af* such that the diagram 


qu&gi-isomorphism 


iv* 


y \ 

V Af* 

quasi- isomorphism 

AT* 


commutes in K(A). Deduce that the composition of two roofs may be consolidated 
into a single roof in K(A). (Hint: Let a* : L* — ► fV*, 0 : M* — > Af* be the given 
morphisms, with 0 a quasi-isomorphism. Consider the composition 7* : L* — ► 
Af* - MC(0)\ and let Af* := AfC(7)[-l]*. We have Af'' = L l © M‘ ©N' -1 , with 
morphisms to L\ M 1 given, respectively, by (f, m, n) •-» £, (£, m, n) i— * — m; these 
define morphisms of cochain complexes. The morphisms N * — * JV 1-1 defined by 
(£, m, n) i-* -n define the homotopy showing that the diagram commutes in K(A). 
To verify that N’ — > L* is a quasi-isomorphism, use the fact that MC(0)‘ is exact, 
and rotate a triangle.) [§9.1] 

9.3. t> Let a* : L* — ► M* be an isomorphism of cochain complexes. Prove that 
MC(a)’ is homotopy equivalent to 0. [§9.2, 9.4] 

9.4. > Let a* : L* — ► A/* be a cochain morphism, and let 0* : M* — > MC(a)* be 
the natural morphism. Prove that MC(0)‘ is homotopy equivalent to L[l]*. (Hint: 
Do Exercise 9.3 first.) [§9.2] 

9.5. > Let 

0 > L* — ^ Af * -?—* N* > 0 

be a short exact sequence of cochain complexes on an abelian category. 

• Prove that there is a cochain morphism 7* : A/C(a)* — * N‘ through which 
0 m : Af* — ♦ N* factors. 

• Prove that MC( 7)* is an exact complex. (Hint: Chase elements.) 

• Conclude that MC{a)* is quasi-isomorphic to IV*. 

[§9.2] 

9.6. t> Let 

0 ► L* M* N* > 0 

be a short exact sequence of cochain complexes on an abelian category A. Prove 
that there is a morphism Af* — * L[lj* in the derived category D(A), inducing the 
connecting morphism H'(N') — > H' +l (L*) in cohomology. [§9.2] 

9.7. > Prove Claim 9.1. [§9.3] 

9.8. t> Let (£f r , dr)r=i,2.... be a spectral sequence arising from an exact couple 


A 
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Prove that the limit E x is isomorphic to 7 1 (fl r inia r )//3(lJ r kera’ ). (Hint: In 
fact, Z r+ 1 = 7 _1 (ima r ) and B r+ 1 = /3(kera r ).) [§9.3] 

9.9. Let v E r be the spectral sequence of a first-quadrant double complex from the 
vertical filtration, as in §9.3. Prove that „E\J = V E'£ for r > j + 1. 

9.10. > Let A, B, C be abelian categories, and let & : A —* B, # : B — » C be 
additive functors. Assume that A and B have enough projectives, that & sends 
projectives of A to ^-acyclic objects of B, and that 'S is right-exact. 

Let A be an object of A, and let M*: » M~ 2 —* M~ l —* M° — * 0 be 

a projective resolution of A. Let be a Cartan-Eilenberg resolution of the 

complex & M' (as constructed in Corollary 7.9). Consider the complex 

(t) a,.,) — a,-.) — «») — >0 

obtained by applying to P^ A/ . . 

This is precisely the set-up of Exercise 8.8. Now take the double complex asso- 
ciated with (f), and construct the spectral sequence corresponding to the horizontal 
filtration. Prove that the P 2 term of this sequence has terms 

= L„tf(L V .?(A)). 

(Use Exercise 7.6 to compute the hE\ term.) 

This is the Grothendieck spectral sequence mentioned at the end of §9.3. Prove 
that it abuts to L»(# o ,9)(A). 

(Use the ‘horizontal’ version of Theorem 9.5, together with the computation of 
the cohomology of the total complex from Exercise 8.8.) (§7.2, 7.6, §9.3] 
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3,2 

V, 3 
», 275 
— , 11 

564 

11, 564 

see natural transformation 
2 a , see power set 

VI, see radical of an ideal 
®, see direct sum 

®, see tensor product 

a.c.c., see ascending chain condition 
Abel, Niels Henrik, 477 
Abel-Ruffini theorem, 477 
abelian category, 156, 559, 564 
abelian group, 45, 62 
finite, classification, 234 
finitely generated, 82, 234, 327 
abstract nonsense, 18 
abuts (spectral sequence), 688 
action, 64 
effective, 109 
faithful, 109 
free, 109 
of a group, 108 
of a ring, 156 
orbit of, 1 10 
transitive, 110 
acyclic object, 662 
additive 
category, 561 
functor, 485, 561, 613 
adjoint functors, 492, 498, 642 
preserve limits/colimits, 493 
adjoint matrix, 331, 369 
affine 

algebraic set, 408 
space, 406 
algebra, 159 
commutative, 159 
division, 159 
finite, 171, 405 
finite type, 171, 405 
finitely generated, 171 
free commutative, 168 
Rees, 164, 528 
algebraic 

closure, 285, 400, 445 
element of an extension, 391 
geometry, 406 
number, 394 

algebraic closure, 285, 400 
of finite fields, 445 
algebraic set 
affine, 408 
projective, 533 


algebraically closed 
C is, 152, 286, 468 
algebraically independent set, 399 
Alperin, Roger, 114 
alternating group, 220, 473 
A n is simple for n > 5, 222 
conjugacy in, 221 
is generated by 3-cycles, 223 
alternating multilinear map, 524 
annihilator, 544 
ideal of a module, 342 
Archimedes, 428 
Argand, Jean-Robert, 286 
Artin, Emil, 401 
Artinian ring, 250 
Artin-Rees lemma, 535 
Artin-Schreier 
extension, 481 
map, 482 
Artinian ring, 250 
ascending chain condition, 244 
for principal ideals, 248, 253, 275 
associate elements in a ring, 246 
associated 
graded object, 689 
prime, 347 
associative law, 42 
automorphism, 29, 49, 67, 192 
inner, 69, 86 
of a field extension, 390 
axiom of choice, 6, 262, 265, 307, 403 

Baer, Reinhold 
criterion, 549 
Baker, Alan, 302 
balanced bilinear map, 515 
Banach, Stefan, 262 
Banach-Tarski paradox, 262 
basis, 306 
ordered, 308 
standard, 316 
transcendence, 400 
Betti numbers, 336 
bifunctor, 679 
bijective, 11 
bilinear map, 501 
nondegenerate, 544 
nonsingular, 544 
bimodule, 517 
binary operation, 42 
Birkhoff- Vandiver theorem, 453 
block matrix, 322 
Boole, George, 144 
Boolean ring, 144, 156, 300 
Burnside, William 
theorem, 214 
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calculus, 433 

cancellation, 45, 46. 121, 122 
holds in integral domains, 122 
canonical decomposition 
in Grp, 97 
in Ring, 141 
in fl-Mod, 162 
in Set. 15 

in abelian categories. 570 
canonical form 
Jordan, 379 
rational, 375 

Cartan-Eilenberg resolution, 651. 675 
category, 18 
Ab, 62, 68 
Fid, 385 
(,'-Mod, 655 
Grp, 58 
(7-Set. Ill 
k-Vect, 158 
fi-AIg, 159 
Ring, 129 
rt-Mod. 158 
Set, 20 
Set’, 21 
Seq(A), 603 
abelian, 156, 559. 564 
additive. 561 
comma, 24 
coslice, 24 
derived. 609, 681 
discrete, 21, 489 
equivalence of, 487 
of affine algebraic sets, 488 
of complexes. 594 
of functors, 497 
opposite, 26. 38 
preadditive, 574 
slice. 24 
small, 18. 22 
subcategory 
full, 26, 595 

triangulated, 602, 610, 618, 650, 683 
with coproducts, 37 
with products, 36 
Cauchy, Augustin Louis 
theorem, 103, 195 
Cayley. Arthur 
graph, 74, 106 
theorem, 1 10. 472 

Cayley-Hamilton theorem. 365. 369, 376 
center 

of a group, 189 
of a ring, 137, 159 
centralizer, 137, 190. 191 
chain. 261 

change of basis, 319 


characteristic 
of a field, 386, 435 
of a ring, 141 

characteristic ideal of a module. 356 
characteristic polynomial, 362 

is a multiple of the minimal polynomial, 
376 

Chinese remainder theorem (CRT), 235, 
291 

class formula, 187, 190 
classification 

of finite abelian groups, 234 
of finitely generated abelian groups, 82. 
327 

of finitely generated modules over PIDs, 
349 
cochain 
complex, 591 
of a group. 657 
coefficient, 125 
cofactors of matrices, 331 
cofree module, 551 
cohomological functor, 685 
cohomology, 174, 178, 553, 592 
as functor, 596 
long exact sequence. 600 
of groups, 655 
coimage, 572 

isomorphic to image, 573 
coinvariant of a group action, 658 
cokernel, 104 

categorical definition. 491. 561 
in Ab. 104 
in Ring, 137 
in R-Mod, 167 
universal property, 104, 166 
colimit. 490, 498 
column space of a matrix, 333 
comma category, 24 
commutative, 45 
ring, 121 
commutator, 210 
of two subsets, 226 
companion matrix, 374 
completion of a ring, 498 
complex, 174, 335, 591 
cochain, 591 
double, 667 
exact, 175, 592 
in an abelian category, 576 
of complexes. 665 
split exact, 621. 627 
total, 606 

composite field, 393, 438 
composition 

factors, 207, 313 
series, 206, 313 
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cone 

mapping, 605, 623, 624 
of a functor, 489 
congruence, 54 
conjugacy class, 190 
in Sn, 216 

conjugation, 110, 189 
connecting morphism, 180, 580 
conservative functor, 520 
constant, 125 
constructible 
geometric figures, 417 
numbers, 421 
regular polygons, 469 
content of a polynomial, 269 
contravariant functor, 484 
coordinate ring of an algebraic set, 413 
coproduct, 36 
fibered, 39, 173 
in an abelian category, 567 
of abelian groups, 62 
of algebras, 511 
of groups, 62, 100 
of modules, 165 
of rings, 133, 512 
universal property, 36 
cosets of a subgroup, 91 
coslice category, 24 
covariant functor, 484 
Cramer, Gabriel 
rule, 328, 332, 365 
cycle, 215 
notation, 215 
cyclic group, 54, 67, 103 
Z/pZ* is a, 67, 240 
infinite, 67, 70 
subgroup of, 82 

the multiplicative group of a finite field is 
a, 239, 442 

cyclic module, 174, 341 
cyclotomic 
field, 448, 480 

polynomial, 67, 289, 426, 446 
cylinder 
mapping, 615 

D-brane, 610 
D-module, 534 

d.c.c., see descending chain condition 
Dedekind, Richard, 256, 298 
degree, 125 
in a graded ring, 527 
inseparable, 439, 440 
separable, 436 
delta functor, 660, 680 
derivative of a polynomial, 433 
derived 


couple, 687 
functor, 642, 645 
composition, 645, 695 
left, 645 
right, 646 

derived category, 609, 681 
poor man’s, 635 
derived series of a group, 211 
descending chain condition, 250, 314 
determinant 
of a matrix, 328 
of a module, 526 
of an endomorphism, 361 
diagonalizable 

linear transformation, 368, 380 
real symmetric matrices are, 381 
diagram, 10 
chase, 180 
commutative, 10, 19 
in a category, 20 
differential 
form, 530 
of a complex, 592 
operator, 534 
dihedral group, 52, 225 
presentation, 57, 106 
dimension 
global, 679 
injective, 679 

Krull, of a ring, 153, 250, 354, 679 
of a vector space, 311 
projective, 678 
direct limit, 402, 491 
direct product, 61, 226 
direct sum, 62, 76, 77, 164 
in an abelian category, 570 
directed set, 491, 498 
Dirichlet, Peter Gustav Lejeune 
theorem on primes in arithmetic 
progressions, 454, 480 
discrete valuation ring, 260, 278 
discriminant, 473, 478, 482 
disjoint union, 4, 16 
distinguished triangle, 606, 683 
divisible 
module, 550 
division 
algebra, 159 
ring, 123, 128 
double complex, 667 
dual 

basis, 539 
double, 542 
group, 241 
kills torsion, 542 
module, 537 
Pontryagin, 555 
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vector space, 537 
DVR, 260, 278, 416 

echelon form, 325 
of a matrix, 325 
effaceable functor, 660 
effective action, 109 
eigenspace, 367, 379 

eigenvalue of a linear transformation, 365 
multiplicity of, 366, 367, 379 
eigenvector, 367 

Eisenstein, Ferdinand Gotthold Max 
criterion, 288 
element, 1 

of an object of an abelian category, 584 
elementary 
divisors, 237, 354 
operations, 320 
symmetric functions, 472 
endomorphism, 19, 119, 134, 359 
characteristic polynomial of an, 362 
determinant of an, 361 
trace of an, 362 
enough 

injectives, 550, 620 
projectives, 548, 620 
epimorphism, 29 

in an additive/abelian category, 563 
of groups, 104 
of modules, 167 
of rings, 132, 398 
of sets, 14 
split, 178, 547 
equalizers, 490 
equivalent categories, 487 
equivariant function, 111 
essentially surjective functor, 488 
Euclid, 261 

Euclidean algorithm, 257 
Euclidean domain, 255, 323, 341, 347 
=*■ P1D, 256 
Euler, Leonhard, 298 
^-function, 58, 70, 87, 107, 301 
characteristic 
of a complex, 335 
universal, 337, 605, 695 
theorem, 107 
exact 

couple, 687 

functor, 495, 588, 603, 613, 643 
faithfully, 499 
left- or right-, 495, 653 
sequence, 175, 228 

in an abelian category, 576 
of Ext, 552 
of pointed sets, 582 
of Tor, 509 


short, 176 

split, 177, 229, 621, 627 
triangle, 601, 684 
exactness property 
of ®, 507 
of Horn, 537 
of TJj, SJj, AJj, 531 
of adjoint functors, 495 
of functors, 495 
of the duality functor, 539, 541 
exponential map, 64 
Ext, 551, 647 
long exact sequence, 552 
may be computed by resolving either 
argument, 553, 677 
why it is called Ext, 556 
extension 
algebraic, 391 
Artin-Schreier, 481 
degree of an, 386 
field, 163, 283 
finite, 386 

finitely generated, 393 
Galois, 458 
normal, 431 
of groups, 228 
of modules, 184, 555, 556 
of scalars, 518 
quadratic, 422 

contained in cyclotomic field, 482 
radical, 475 
separable, 436 
simple, 387, 449 
solvable, 475 
split, 177, 229, 231 
exterior 
algebra, 529 
power, 524 

factorial ring (UFD), 248 
factorization into irreducibles, 248 
unique, 248 
faithful 
action, 109 
faithfully exact, 499 
functor, 488, 587 
Feit-Thompson theorem, 212, 214 
Fermat, Pierre de 
last theorem, 280 
little theorem, 103, 107, 439 
primes, 427, 454 

theorem on sums of squares, 298, 299 
Ferrers diagram, 216 
fiber, 13 
square, 567 

fibered products and coproducts, 39, 173 
in an abelian category, 567, 577 
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field, 123 

algebraic closure, 285, 400. 401, 445 
algebraically closed, 152. 285 
characteristic of, 386, 435 
composite. 393 
cyclotomic. 448, 480 
extension, 163, 283. 386 
algebraic. 391 
degree of a. 386 
Unite, 386 

finite =s- algebraic, 391 
finitely generated. 393 
Galois. 458 
normal, 431 
quadratic, 422, 482 
radical, 475 
separable, 436 
simple, 387, 449 
solvable, 475 
finite, 441 
fixed, 455 
Galois, 442 
intermediate, 392 
of fractions, 270 
of quotients, 270 
of rational functions, 273 
perfect, 435 

prime subfield of, 279. 386 
splitting, 429 
filtration, 689 
finite field, 441 

finite separable extensions are simple, 450 
five-lemma, 185, 590 
short, 184, 589 
fixed field, 455 
flag of subspaces, 383 
flat 

criterion for flatness, 513, 514 
free =» , 508 
is a local property, 513 
module, 507, 552, 662 
modules over local rings are free, 514 
projective => , 548 
forgetful functor, 485 
right-adjoint to free, 493 
four-lemma, 184. 589 
FYankel, Abraham, 1 
free 

abelian group, 75 
action, 109 
algebra, 167 

functor, left-adjoint to forgetful, 493 
group. 70 

universal property, 71 
locally, 556 
module. 167, 305 
basis of, 306 


homomorphisms of, 314 
is flat, 508 
is projective, 548 
product, 62 

resolution of a module, 343 
freshman's dream, 439 
FYeyd-Mitchell embedding theorem, 156, 
559, 588, 591 
FYobenius, Georg 
homomorphism, 435 
full 

functor, 488, 588 
subcategory, 26. 595 
function, 9 
composition, 10 
identity. 9 
set-function, 19 
functor, 61 
6-functor, 660, 680 
additive, 485, 561, 613 
adjoint, 492. 498, 642 
bifunctor, 679 
category, 497 
cohomological, 685 
cohomology, 596 
cone of a, 489 
conservative. 520 
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contravariaut, 484 
covariant, 484 
derived, 509, 642, 645 
effaceable. 660 

equivalence of categories, 488 
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exact, 495, 588, 603, 613, 643 
Ext, 551 

faithful. 488, 587 
forgetful, 485. 493 
full, 488. 588 
fully faithful. 488 
natural 

isomorphism, 492 
transformation, 492, 636 
of points, 487, 581 
representable, 487, 497 
Tor. 509 

fundamental theorem 
of algebra, 286, 468 
of arithmetic, 255 
on symmetric functions, 471 
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criterion, 477 

extension, 391, 454, 458, 657 
field. 442 

group of a polynomial, 477 
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group of an extension, 159 
inverse problem, 473 
theory, 454 

Gauss, Carl Friedrich. 286. 302 
elimination. 322 
lemma. 270, 273 

Gaussian elimination, 322, 347. 376 
over Euclidean domains, 323 
Gaussian integers, 294 
gcd. 252 

general linear group, 321 
germ of a rational function, 415 
global dimension, 679 
golden ratio, 425 
graded 
algebra, 527 
homomorphisms, 528 
module, 413, 527 
ring, 413, 527 

Gram-Schmidt process, 371 
graph. 9 

Grassmannian. 326, 545 
Grayson, Dan. 266 
greatest common divisor, 252 
Grothendieck, Alexander. 413 
group, 337, 605 
spectral sequence, 695, 697 
group, 29, 42 
p-group. 188, 190 
abelian. 15, 62 
action, 64, 108 
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invariants, 655 
alternating, 220, 473 
automorphism 
inner. 86. 193 
center of a, 189 
class formula, 190 
cohomology, 655 
cyclic. 54, 67, 82. 103 
described by generators and relations, 99 
dihedral. 52, 225 
dual. 241 

finitely generated, 82 
finitely presented, 99 
free, 70 

free abelian, 75 
general linear, 321 
generated by a subset, 52, 81 
Grothendieck, 337, 605 
homology. 658 
homomorphism, 59 
infinite cyclic, 67, 70 
modular, 63, 95, 1 14 
nilpotent. 213, 233 
object. 1 15 
of order p 2 . 191 


of order p 2 </, 214 
of order p 3 . 240 
of order pq, 201 , 232 
of order 8. 204, 240 
of permutations, 50 
of units in a ring, 123 
opposite, 113 
orthogonal, 370 
presentation, 99 
quaternionic, 88, 128, 234 
quotient, by a normal subgroup, 93 
quotient, by an equivalence relation, 90 
ring, 127, 655 
simple, 196, 205 
solvable, 211, 475 
symmetric, 50, 478 
trivial, 42, 61 
unitary, 370 
groupoid, 29, 41 

Hamilton, William Rowan, 128 
Hasse, Helmut, 256 
Hauptidealsatz, 259 
Hausdorff, Felix 
maximal principle, 264 
height of a prime ideal, 259 
hermitian 
matrix. 371 
product, 370 
Hilbert. David 
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Nullstellensatz, 153, 171, 285, 400, 404 
‘theorem 90’, 467, 658 
Zahlbericht, 467 
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Horn 

functors, 486 

is left-exact, not right-exact, 537 
is right-adjoint. 536 
homogeneous 

element of a graded ring, 527 
ideal, 528 

homological algebra, 174, 495, 559 
homology and cohomology, 178, 335, 592 
computing Ext, 553 
computing Tor, 509 
homology of groups, 658 
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as Ext 0 , 552 
of field extensions, 386 
of fields, 385 
of groups, 59 
of rings, 129 

homomorphism of groups, 53 
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between morphisms of complexes, 611 
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Hurewicz, Witold, 492 
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icosahedral group, 223 
ideal, 138 

<=> kernel, 141 
annihilator, 342 
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homogeneous, 528 
intersection, 146 
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maximal, 150 
prime, 150 

minimal, 250, 267, 348 
principal, 145 
product, 146, 269 
radical, 409 
sum, 145 

identity element, 42 
image 

isomorphic to coimage, 573 
of a morphism, in an abelian category, 
572 

indeterminate, 125 
induction, 265 
principle of, 265 
transfinite, 263 
injective, 29 

R-Mod has enough injectives, 548, 550 

abelian group, 550 

dimension, 679 

enough injectives, 550, 620 

function, 11, 12, 14 

module, 546 

object of an abelian category, 619 
resolution, 629 
resolution of a module, 551 
inner product, 370 
inseparable 
degree, 439, 440 
element of an extension, 436 
polynomial, 433 
purely, 439 
integral domain, 122 
Invariant Basis Number property, 310 
invariant factors, 237, 354 
invariants of a group action, 655 
inverse. 42 
is unique, 43 

inverse Galois problem, 473 
inverse limit, 489 
irreducible, 247 


=> prime in UFDs, 253 
algebraic set, 415 
factor, 251 
module, 163 
irrelevant ideal, 533 
isomorphism, 27 

4=> monomorphism and epimorphism, 
in abelian categories, 566 
of sets, 11 
quasi-, 607 

theorems, 97, 101, 142, 162 

Jacobi, Carl Gustav Jacob 
identity, 312 
Jacobson, Nathan, 120 
radical, 267 
Jordan block, 378 
Jordan canonical form, 379 
Jordan-Holder theorem, 206 

kernel, 80, 132 

<=> ideal, in Ring, 141 
4=> monomorphism, in an abelian 
category, 564 

<=> normal subgroup, in Grp, 95 
-s=> submodule, in fl-Mod, 161 
categorical definition, 490, 561 
universal property, 81, 166 
Koszul, Jean-Louis 
complex, 348, 535 

Kronecker-Weber theorem, 480, 482 
Krull, Wolfgang, 498 
dimension, 153, 250, 354 
Hauptidealsatz, 259 
theorem, 265 
Kummer, Ernst, 468 
theory, 465 

Lagrange, Joseph Louis 
four-square theorem, 299, 303 
interpolation, 290 
resolvent, 465 
theorem, 103 
Lame, Gabriel, 280 
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of intermediate fields of an extension, 460 
of subgroups, 84 
of a quotient, 100 
Laurent polynomial, 127, 656 
leading coefficient, 147, 245 
lemma 

Artin-Rees, 535 
five-, 185, 590 
four-, 184, 589 
Gauss’s, 270 
horseshoe, 648 

Nakayama’s, 164, 339, 357, 514 
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Schur's, 163 

snake, 179, 510, 553, 579, 597 
Yoneda, 497, 591 

Zorn’s, 262, 265, 307, 311, 403, 549 
Lichiardopol, Nicolas, 453 
Lie, Sophus 
algebra, 127, 312 
bracket, 312 
limit, 489 
direct, 445, 491 
injective, 491 
inverse, 489 
projective. 489 
linear 
map, 158 
maps, similar, 361 
polynomial, 284 
transformation, 359 
linearly dependent/independent, 306 
Liouville, Joseph 
theorem, 152 
local 

parameter, in a DVR, 260 
ring, 278, 339, 357, 514, 556 
localization, 270, 415, 681 
as a functor, 496 
is exact, 500 
of a module, 277, 496 
of a ring, 277 
locally factorial, 279 
locally free, 556 
<=> flat, 556 
<=> projective, 556 
long exact sequence 
in (co)homology, 600 
of (co)homology, 180 
of derived functors, 647, 651 
of Ext, 552 
of Tor, 509 
Liiroth, Jacob 
theorem, 400 

Mac Lane, Saunders, 120 
mapping 

cone, 605, 623, 624, 665 
in topology, 607 
cylinder, 615 
in topology, 615 
Marcolli, Matilde, 419 
Massey, William, 687 
matrices. 43, 315 
adjoint, 331 
block, 322 
cofactors of, 331 
companion, 374 

corresponding to homomorphisms, 316 


diagonalizable, 368, 380 
echelon form, 325 
elementary, 321 
elementary operations, 320 
equivalent, 320 
hermitian, 371 

inverse, through determinants, 332 
inverse, through Gaussian elimination, 
338 

Jordan canonical form of, 379 
minor of, 330 
normal, 383 
of change of basis, 319 
orthogonal, 370 
rational canonical form of, 375 
ring of, 121, 316 
self-adjoint, 371 
similar, 360 
symmetric, 371 
transpose, 329, 377, 541 
triangular, 86 
unitary, 370 
maximal ideal, 150 

every proper ideal is contained in a, 265 
McKay, James, 195 
Milne, James S., 473 
minimal polynomial 

divides the characteristic polynomial, 376 
of a linear transformation, 365, 397 
of an element of a field extension, 388 
minor of a matrix, 330 
Mobius transformation, 114 
modular group, 63, 95, 114 
module, 157 

Z-module <==> abelian group, 158 
algebra, 159 

associated to a linear transformation, 372 
bimodule, 517 
cofree, 551 
divisible, 550 
dual, 537 
finite, 171 
finite length, 314 
finitely generated, 169, 342 
over a PID, 240, 349, 354 
finitely presented, 342 
flat, 507 
free, 167, 306 
injective, 546 
irreducible, 163 
localization, 277, 496 
Noetherian, 170 
over a group, 655 
projective, 173, 546 
quotient, 161 
reflexive, 542 
simple, 163, 174 
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submodule, 160 
torsion, 340, 355 
torsion-free, 340 
monic polynomial, 147 
monoid, 120, 126, 169 
monomorphism, 29 
in an additive/abelian category, 563 
of groups, 84 
of modules, 167 
of rings, 132 
of sets, 14 
split, 178, 547 
Morita, Kiiti 
equivalent rings, 522 
morphism, 18 
trivial, 64 

multilinear map, 522 
alternating, 524 
symmetric, 524 
multiplication table, 48 
multiplicative subset, 277 
multiplicatively closed set, 277 
multiplicity 

of a root of a polynomial, 281 
of an eigenvalue, algebraic, 366, 379 
of an eigenvalue, geometric, 367, 379 
multiset, 2, 10, 27, 207, 251 

Nakayama, Tadashi 
lemma, 164, 339, 357, 514 
natural 

isomorphism, 492 
transformation, 492 
natural projection, 15 
natural transformation, 636 
nilpotent, 128, 144, 279, 382, 409 
group, 213, 233 
nilradical, 144, 155, 267, 409 
and associated primes, 348 
nine-lemma, 185 
Noether, Emmy 
normalization theorem, 416 
Noetherian 
module, 170 
ring, 145, 244, 267 
non-zero-divisor, 122 
nondegenerate bilinear map, 544 
nonsingular bilinear map, 544 
norm 
in H, 136 

in Q(v^), 154, 260 
in Z[v/=i], 301 
in Z[V^5], 251 
in Z[>], 295 

of an element of a field extension, 398, 
440, 467 

is transitive, 440 


normal 

bundle, 543, 556 
extension, 431 
subgroup, 88 
<==> kernel, 95 
uormalizer, 191 
nullity, 334 

Nullstellensatz, 153, 171, 404 
strong, 410 
weak, 409 
homogeneous, 533 

object 
final, 31 
initial, 31 
terminal. 31, 33 
zero-, 61, 64, 159, 561 
octahedral axiom, 683 
operator, 359 

opposite category, 26, 38, 484, 620 

orbit, 110 

order 

of a group, 47 
of a power series, 250 
of an element, 46 
ordered basis, 308 
ordered pairs, 5 
orthogonal 
complement, 370 
group, 370 
vectors, 370 

orthonormal set of vectors, 370 

PSLj(Z), 95, 114 
p-adic 

integers, 498 
numbers, 498 
partial fractions, 276 
partition, 7 

of a positive integer, 216, 224, 240 
Pavlovian reaction, 97, 176, 505, 537, 566 
perfect field, 435 
permutation, 50 
conjugate, 217 
cycle notation, 215 
even, odd, 219 
type of, 216 
PID, 145, 254 
=* UFD, 254 

characterization of, 259, 344. 347, 353, 
358 

classification of finitely generated 
modules over a, 240, 354 
divisible over a injective, 550 
pivot, 325 
platonic solids, 57 
Pliicker, Julius 
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embedding. 532 
pointed set, 24 
polynomial, 125 
constant, 125 
content of a. 269 
cyclotoinic. 67, 289. 126, 1 16 
derivative of a. 433 
function, 131, 282 
inseparable, 133 
irreducible over a UFD. 275 
Laurent. 127, 656 
monic, 147 
primitive. 268 
root. 281 
separable. 433 
very primitive, 268 

with noncommuting indeterminates, 169, 
531 

Pontryagin, lx?v Semenovich 
dual, 555 

poset = partially ordered set. 261 
power 

series, 126, 498 
order, 250 

ring, Euclidean domain, 259 
ring, Noetberian, 250 
ring, not necessarily a UFD, 273 
set. 1. 18. 127 
preadditive category, 574 
presentation 
of a group, 99 
of a module. 342 
prcslieaf. 485, 575 
on a topological space. 486 
prime 

= =► irreducible, 247 
associated, of a module, 347 
element in an integral domain, 247 
ideal, 150 

minimal. 250, 267, 348 
nonzero ideals are maximal in a P1D, 151 
subfield, 279, 386 
primitive 

polynomial, 268 
very, 268 

prime divisor. 453 
root of 1 , 446 

principal ideal domain, see PID 
principle of induction, 263 
product, 5, 489 
direct . 61, 226 
fibered, 39. 173 
free, 62 

in an abelian category, 567 
of groups, 61 
of modules, 165 
of rings, 133 


semidirect, 230 
universal property, 35 
projection, 172 
formula, 521 
projective 

/?-Mod lias enough projectives. 548 
and splitting sequences, 547 
dimension, 678 
enough projectives, 548. 620 
flat ==► . in some cases, 548 
module, 173, 546 

modules over local rings are free, 358, 556 
object of an abelian category. 619 
resolution, 629 
resolution of a module, 548 
projective space, 326. 413, 416 
algebraic set in, 417. 533 
coordinates in, 416 
pull-back, 173 

in an abelian category, 567 
pure tensor, 504 
purely inseparable. 439 
push-out, 173 

in an abelian category, 567 

quantifier, 2 

quasi- isomorphism, 592. 607 
quaternion, 128, 136, 234, 302 
integral, 302 
norm. 136 

qnaternionic group. 88, 128, 234 
quotient, 4, 7 
group, 90, 93 

in an abelian category, 564, 588 
module. 161 
ring, 140 

universal property, 33 

Rabinowitsch, J. L. 
trick, 110 

radical of an ideal, 409 
rank 

of a linear map, 334 
of a matrix, 334 
of a module, 311, 354 
rational 

canonical form, 375 
function, 387 
root test, 282 
rational function, 273 
reduced ring, 144, 412, 512 
Rees, David 
algebra, 164, 528 
refinement 

of a normal series, 209 
reflexive module, 542, 555 
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regular polygon, constructibility of a, 418, 
424, 426, 449, 469 
regular sequence, 348, 535 
relation, 6 

compatible with a group structure, 90 
equivalence, 7 
order, 261 

representable functor, 497 
resolution, 509, 592, 607, 629 
Cartan-Eilenberg, 651, 675 
fully projective, 651 
injective, 551, 629 
of a complex, 632, 672 
of a module, 343 
projective, 548, 629 
restriction 
of scalars, 518 

of sections of a presheaf, 496 
ring, 119 
Artinlan, 250 
Boolean, 144, 156, 300 
center of a, 137, 159 
characteristic of, 141 
commutative, 121 
discrete valuation, 260, 278 
division, 123, 128 
domain with factorizations, 248 
Euclidean domain, 255 
factorial, 248 
graded, 413, 527 
homomorphism, 129 
ideal of, 138 
integral domain, 122 
Krull dimension of a, 153, 250, 354, 679 
local, 278, 339, 357, 514, 556 
localization, 270, 277, 415 
Noetherian, 145, 244, 267 
of Laurent polynomials, 127 
of matrices, 121 

of power series, 126, 250, 259, 273, 498 
of quaternions, 128, 136, 302 
P1D, 145 
polynomial, 124 
quotient, 140 
reduced, 144, 412, 512 
simple, 144 
subring, 132 
UFD, 248 
zero-, 121 
rng, 120, 139 
roof diagram, 681, 682 
root 

of 1, 445 

of a polynomial, 281 
primitive, 446 
row space of a matrix, 333 
Ruflini, Paolo, 477 


S0 3 (R), 85 

is a quotient of SU 2 (C), 106 
SU 2 (C), 106 

and quaternions, 136, 234 
is simply connected, 86 
scalar, 308 
extension, 518 
restriction, 518 
scheme, 413 
Schreier’s theorem, 209 
Schubert, Hermann 
cells, 326 
Schur, Issai, 473 
decomposition, 383 
form, 383 
lemma, 163 
section, 13 
of a presheaf, 496 
self-adjoint matrix, 371 
semidirect product, 230 
semigroup, 126 
separable 
closure, 439 
degree, 436 

element of an extension, 436 
extension, 436 
polynomial, 433 
series of subgroups, 205, 689 
abelian, 211 
composition, 206 
cyclic, 211 
derived, 211 
equivalent, 207 
normal, 205 
Serre, Jean-Pierre 
problem, 556 
set, 1 

-function, 19 
indexed, 10 
multiset, 10 
of parts, 4 
pointed, 24 
power, 4, 18, 127 
Shafarevich, Igor, 473 
sheaf, 486, 496, 575 
cohomology, 575 
shift functor, 596 
short exact sequence, 176 
of complexes, 597 
similar 

eudomorphisms, 361 
matrices, 360 
simple 

group, 196, 205 
of order 60, 205, 222 
module, 163, 174 
of a double complex, 667 
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ring, 144 
singleton, 2, 24 
slice category, 24 
Smith normal form, 324 
snake lemma, 179, 510, 553, 579, 597 
solvability of polynomial equations, 474 
solvable 
extension, 475 
for n > 5, Sn is not, 224 
group, 211, 475 
Spec ft, 151, 413, 486 
spectral 

decomposition, 367, 380 
theorem, 383 

spectral sequence, 670, 686 
Grothendieck, 695, 697 
of a double complex, 691 
picture, 694 
spectrum 
maximal, 155 
of a ring, 151, 413, 486 
of an operator, 365 
split 

epimorphism, 178, 547 
exact complex, 621, 627 
exact sequence, 177 
monomorphism, 178, 547 
splitting 
field, 429 

sequence, 177, 229, 547 
stabilizer. 111, 187 
standard basis, 316 
Stark, Harold, 302 
Stecnrod, Norman, 18 
straightedge-and-compass constructions, 
417 

subgroup 
p-Sylow, 196 
centralizer, 190, 191 
characteristic, 202 
commutator, 83, 96, 210, 226 
cosets of, 91 
cyclic, 82 

generated by a subset, 81 
index of, 102 
normal, 88 
normalizer, 191 
of a free group, 83 
of cyclic group, 82 
stabilizer, 111, 187 
transitive, of S„, 225 
submodule, 160 
<=> kernel, 161 
generated by a subset, 169 
subobject classifier, 27 
subring, 132 
surjective function, 12 


Sylow theorems 
first, 196 
second, 198 
third, 199 
symmetric 
algebra, 529 
functions 
elementary, 472 
fundamental theorem, 471 
group, 50, 214, 478 
conjugacy in, 217 
is generated by transpositions, 219 
matrix, 371 
multilinear map, 524 
power, 524 
symmetry, 52 

systems of linear equations, 327 

Tarski, Alfred, 262 
tensor 
algebra, 529 
as Toro, 509 

by free modules is exact, 507 
commutes with direct sums, 505 
is associative, 523 
is left-adjoint to Horn, 505, 517 
is right-exact, not left-exact, 505, 507 
power, 523 
product, 501 
pure, 504 
theorem 

Abel-Ruffini, 477 
Baer’s criterion, 549 
Birkhoff- Vandiver, 453 
Burnside’s, 214 
Cauchy's, 195 
abelian case, 107 
Cayley’s, 110, 472 
for rings, 135 

Cayley-Hamilton, 365, 376 
characterizations of Galois extensions, 
457 

Chinese remainder, 291 
classification of finite abelian groups, 

237, 480 

classification of finitely generated 
modules over a PID, 354 
cohomology of total complexes, 674 
constructibility of the regular n-gon, 469 
derived functors are computed by acyclic 
resolutions, 663 
Dirichlet’s, 454, 480 
Euler’s, 107 

exactness of the total complex, 670 
Ext may be computed by resolving either 
argument, 677 
Feit-Thompson, 212, 214 
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Fermat’s 
last, 280 

little, 103, 107, 439 
on sums of squares, 298 
first isomorphism 
for groups, 97 
for modules, 162 
for rings, 142 
fixed point. 188 

FYeyd-Mitchell embedding, 156, 559. 588, 
591 

fundamental 
of algebra, 285, 468 
of arithmetic, 255 
of Galois theory, 460, 461 
on symmetric functions, 471 
Hilbert’s 

‘90’, 467, 482, 658 
basis. 172, 245, 407 

Nullstellensatz, 153. 171, 405, 409, 410 
Jordan-Holder, 206, 313 
Kronecker- Weber, 480, 482 
Kruil 

Hauptidealsatz, 259 
Krull's, 265 
Lagrange's, 103 

Lagrange's four-square. 299, 303 
Liouville’s. 152 
long exact sequence 
in cohomology, 600 
of derived functors, 651 
Ltiroth's, 400 

Noether’s normalization, 416 
of nonsolvability by radicals, 474 
on constructibility by straightedge and 
compass, 422 

realization of the derived category, 635 
Schreier’s. 209 
second isomorphism 
for groups, 101 
for modules, 162 
for rings, 142 

spectral, for normal operators, 383 
Sylow’s 
first, 196 
second, 198 
third, 199 
third isomorphism 
for groups, 101 
for modules, 162 
for rings, 142 

Tor may be computed by resolving either 
argument, 676 

Wedderburn's (little), 124, 137, 204, 441, 
453 

well-ordering, 263 
Wilson’s, 70, 225 


Tor. 509. 646 
long exact sequence, 509 
may be computed by resolving either 
argument, 509, 676 
why it is called Tor. 513 
torsion 
element, 340 
module, 340, 355 
submodule, 340 
total complex 

conditions for exactness, 670 
of a double complex, 668 
total complex of a double complex, 606 
totient function, 87 
trace 

of a matrix, 362 

of an element of a field extension, 398, 
440, 467 
is transitive, 440 
of an endomorphism, 362 
transcendence 
basis. 400 
degree. 400 
transcendental 
ir is. 394, 418. 426 
element of an extension, 391 
number, 394 
purely, extension, 400 
transitive 
action, 110 

subgroup of S n , 225, 478 
transitivity of norm and trace, 440 
transpose of a matrix, 329, 377, 54 1 
transposition, 219 
triangle 

distinguished, 602. 606, 650. 683 
exact, 601, 684 
triangular matrix, 86 
triangulated category, 602, 610, 618, 650, 
683 

UFD, 248 
fl(x) is if R is, 273 
Z[x] is a, 276 
characterization of, 253 
every P1D is a, 254 
is not a local property, 279 
locally factorial, 279 

primes of height 1 are principal in a. 259 
unique factorization domain, see UFD 
unit, 123 

unitary group, 370 
universal 
identity, 332 
property, 31, 489 
upper bound, 262 
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valuation 

Oedekind-Hasse, 256 
discrete. 260 
Euclidean, 256 
variety. 407, 415 
vector. 308, 316 
bundle, 358 
space, 158, 308 
basis. 306 
dimension, 31 1 
dual, 537 
Venn. John 
diagram. 1 
versal, 281, 388 

VVedderburn, Joseph 

(little) theorem. 124, 137, 204. 441, 453 
wedge power, 524 
Weil. Andr6 
conjectures. 369 
well-defined, 16 
well-ordering, 262 
principle. 83, 262 
theorem. 263 
YVeyl, Hermann 
algebra, 534 
Wilson. John 
theorem. 70, 225 
YVitt. Ernst. 453 
word. 72 

word problem, 100 

woset = well-ordered set, 262 

Yoneda, Nobno 
Ext, 557 
lemma, 497 
Young diagram, 216 

Zariski. Oscar 
topology. 408, 497 
Zernielo, Ernst, 1 
Zermelo- Fran kel, 1, 262 
zero-divisor. 122 

and associated primes. 348 
non-, 122 

zero-morphism, in an abelian category, 561. 
574 

zero-object. 61, 64, 159, 561 
Zorn, Max 

lemma, 262. 265, 307, 311, 403, 549 
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